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MODULATORS OF TTSSTTF REGENERATION 

FIELD OF THE INVENTION 
The invention relates to proteins which are upregulated in injured or regenerating tissues, 
as well as to the DNA encoding these proteins. The invention further relates to therapeutic 
compositions and methods of treatment encompassing these proteins. 

BACKGROUND OF THF, INVENTION 
A dynamic remodeling of tissue architecture occurs during development and during 
tissue repair after injury. To study this process, we have focused on a model of kidney injury 
caused by an ischemia-reperfusion insult. 

The kidney is able to repair damage to the proximal tubule epithelium through a 
complex series of events involving cell death, proliferation of surviving proximal tubule 
epithelial cells, formation of poorly differentiated regenerative epithelium over the denuded 
basement membrane, and differentiation of the regenerative epithelium to form a fully 
functional proximal tubule epithelial cells (Wallin et al., Lab. Invest. 66:474-484, 1992; 
Witzgall et al., Mol. Cell. Biol. 13:1933-1942, 1994; Ichimura et al., Am. J. Physiol. 
269:F653-662, 1995; Thadhani et al., N. Engl. J. Med. 334:1448-1460, 1996). Growth 
factors such as IGF, EGF, and HGF have been implicated in this process of repair, as has the 
endothelial cell adhesion molecule ICAM-1 . However, the mechanisms by which the tubular 
epithelial cells are restored are still not understood. 

To identify molecules involved in process of injury and repair of the tubular 
epithelium, we analyzed the difference in the mRNA populations between injured/regenerating 
and normal kidneys using representational difference analysis (RDA). RDA is a PCR-based 
method for subtraction which yields target tissue or cell specific cDNA fragments by repetitive 
subtraction and amplification (Hubank and Schutz, Nucl. Acids Res. 22:5640-5648, 1994). 
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SUMMARY OF T H E I NVENT IO N 
The invention generally provides Kidney Injury-related Molecules (each of which is 
henceforth called a "KIM") which are upregulated in renal tissue after injury to the kidney. The 
KIM proteins and peptides of the invention, as well as their agonists and antagonists, and their 

S corresponding are useful in a variety of therapeutic interventions. 

The invention provides a purified and isolated DNA molecule having a nucleotide 
sequence set forth in SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. The 
invention also includes the complementary strands of these sequences, DNA molecules which 
hybridize under stringent conditions to the aforementioned DNA molecules, and DNA molecules 

10 which, but for the degeneracy of the genetic code, would hybridize to any of the DNA molecules 
defined above. These DNA molecules may be recombinant, and may be operably linked to an 
expression control sequence. 

The invention further provides a vector comprising a purified and isolated DNA molecule 
having a nucleotide sequence set forth in SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:4 or SEQ 

IS ID NO:6, or one of the other DNA molecules defined above. This vector may be a biologically 
functional plasmid or viral DNA vector. One embodiment of the invention provides a 
prokaryotic or eukaryotic host cell stably transformed or transfected by a vector comprising a 
DNA molecule of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. In another 
embodiment of the invention, a process is provided for the production of a KIM polypeptide 

20 product encoded by a DNA molecule as described above; the process involves growing, under 
suitable culture conditions, prokaryotic or eukaryotic host cells transformed or transfected with 
the DNA molecule in a manner allowing expression of the DNA molecule, and recovering the 
polypeptide product of said expression. 

A purified and isolated human KIM protein substantially free of other human proteins is 

25 specifically within the invention, as is a process for the production of a polypeptide product 
having part or all of the primary structural conformation and the biological activity of a KIM 
protein. KIM proteins of the invention may have an amino acid sequence which comprises SEQ 
ID NO:3 , SEQ ID NO:5, or SEQ ID NO:7, or may be a variant of SEQ ID NO:3, SEQ ID NO:5 
or SEQ ID NO:7, or a purified and isolated protein encoded by the DNA of SEQ ID NO:l , SEQ 

30 ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. These proteins can be provided substantially free of 
other human proteins. The invention further includes variants of these proteins, such as soluble 
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variants or fusion proteins. KIM fusion proteins of the invention may comprise an 
immunoglobulin, a toxin, an imageable compound or a radionuclide. 

The invention also provides a specific monoclonal antibody to the KIM proteins 
described above. The anti-KIM antibody may be associated with a toxin, imageable compound 
5 or radionuclide. Further taught is a hybridoma cell line which produces such a specific antibody. 

Pharmaceutical compositions are also within the scope of the invention. A 
pharmaceutical composition of the invention may comprise a therapeutically effective amount of 
a KIM protein or anti-KIM antibody of the invention, along with a pharmacologically acceptable 
carrier. 

1 0 Diagnostic methods are within the invention, such as assessing the presence or course of 

resolution of renal injury by measuring the concentration of KIM in urine, serum, or urine 
sediment of patients who have or who are at risk of developing renal disease. 

Methods of treatment of the invention include treating patients with therapeutically 
effective amounts of KIM, KIM variants, KIM analogs, KIM fusion proteins, KIM agonists, and 

15 antibodies to KIM or to KIM ligands. Other therapeutic compounds of the invention include 
KIM ligands, anti-KIM antibodies, and fusions proteins of KIM ligands. These compounds can 
be useful in therapeutic methods which either stimulate or inhibit cellular responses that are 
dependent on KIM function. 

Further methods of the invention inhibit the growth of KIM-expressing tumor cells by 

20 contacting the cells with a fusion protein of a KIM ligand and either a toxin or radionuclide, or 
with an anti-KIM antibody conjugated to a toxin or to a radionuclide. Likewise, growth of tumor 
cells which express KIM ligand may be inhibited by contacting the cells with a fusion protein of 
a KIM and either a toxin or radionuclide, or with an anti-KIM ligand antibody conjugated to a 
toxin or to a radionuclide. 

25 The invention also encompasses methods of gene therapy. These include a method of 

treating a subject with a renal disorder, a method of promoting growth of new tissue in a subject, 
and a method of promoting survival of damaged tissue in a subject, comprising administering to 
the subject a vector which includes DNA comprising the nucleotide sequence of SEQ ID NO: 1 , 
SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. 

30 The compounds of the invention are also useful for imaging tissues, either in vitro or in 

vivo. One such method involves targeting an imageable compound to a cell expressing a protein 
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of SEQ ID NO:3, SEQ ID NO:5 or SEQ ID NO:7, comprising contacting the cell with either a 
monoclonal antibody of the invention or a fusion protein comprising a protein as described 
above, fused to an imageable compound. For in vivo methods, the cell is within a subject, and 
the protein or the monoclonal antibody is administered to the subject. 

5 The invention also includes diagnostic methods, such as a method of identifying damage 

or regeneration of renal cells in a subject, comprising comparing the level of expression of either 
SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6 in renal cells of the subject to a 
control level of expression of the sequence in control renal cells. Another method of the 
invention includes identifying upregulation of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:4 or 

1 0 SEQ ID NO:6 in cells comprising contacting the cells with an antisense probe and measuring 
hybridization to RNA within the cell. 

A further embodiment of the diagnostic methods of the invention includes assessing the 
presence or concentration of a molecule of the invention either in urine, serum, or other body 
fluids, or in urine sediment or tissue samples. The measured injury-related molecule can be 

1 5 correlated with the presence, extent or course of a pathologic process. This correlation can also 
be used to assess the efficacy of a therapeutic regime. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is the nucleotide sequence of rat clone cDNA 3-2, with putative protein reading frame 
of615tol535. 

20 FIGURE 2 is a listing of the cDNA sequence of rat clone 1-7, with putative protein reading 
frame of 145 to 1065. 

FIGURE 3 is a listing of the cDNA sequence of rat clone 4-7, with putative protein reading 
frame of 107 to 1822. 

FIGURE 4 is a listing of the cDNA and deduced amino acid sequences of human clone HI3-10- 
25 85, with putative protein reading frame of 1 to 1002. The upper line of the listing is the cDNA 
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sequence (SEQ ID NO:6), and the lower line is the deduced amino acid sequence (SEQ ID 
N0:7). 

FIGURE 5 is a BESTFIT comparison of the nucleotide sequence of human clone HI3- 10-85 with 
that of rat clone 3-2. 

DETAILED DESCRIPTION PIT THE TNVFNTION 

We identified KIM genes by analyzing differences in mRNA expression between 
regenerating and normal kidneys using representational difference analysis (RDA). RDA is a 
PCR-based method for subtraction which yields target tissue or cell-specific cDNA fragments by 
repetitive subtraction and amplification. The cDNA representation from 48 hr postishemic 
adult rat kidney RNA is subtracted with the sample from normal (sham-operated) adult rat 
kidney. In this procedure, sequences which are common to both postishemic and to normal 
kidney samples are removed, leaving those sequences which are significantly expressed only in 
the injured kidney tissue. Such genes encode proteins that may be therapeutically beneficial for 
renal disorders or involved in the injury process. Several clones have been obtained, sequenced 
and characterized. The clones are then investigated for their expression patterns during kidney 
repair, development and tissue distribution by northern analysis and RNA in situ hybridization. 

Sequence Identification Numh^ 

Nucleotide and amino acid sequences referred to in the specification have been given the 
following sequence identification numbers: 

SEQ ID NO: 1 - nucleotide sequence of rat 3-2 cDNA insert 
SEQ ID NO:2 - nucleotide sequence of rat 1-7 cDNA insert 

SEQ ID NO:3 - amino acid sequence of rat KIM-1, encoded by rat 3-2 and 1-7 cDNA's 

SEQ ID NO:4 - nucleotide sequence of rat 4-7 cDNA insert 

SEQ ID NO:5 - amino acid sequence encoded by 4-7 cDNA insert 

SEQ ID NO:6 - nucleotide sequence of human cDNA clone HI 3- 10-85 

SEQ ID NO:7 - amino acid sequence encoded by human cDNA clone HI 3-10-85 
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Definitions of Terms 

A "KIM protein", herein used synonymously with "KIM", is a protein encoded by 
mRN A which is selectively upregulated following injury to a kidney. One group of KIM 
proteins of interest includes those coded for by mRNA which is selectively upregulated at any 

S time within one week following any insult which results in injury to renal tissue. Examples of 
times at which such upregulation might be identified include 10 hours, 24 hours, 48 hours or 96 
hours following an insult. Examples of types of insults include those resulting in ischemic, toxic 
or other types of injury. 

A "KIM agonist" is a molecule which can specifically trigger a cellular response 

10 normally triggered by the interaction of KIM with a KIM ligand. A KIM agonist can be a KIM 
variant, or a specific antibody to KIM, or a soluble form of the KIM ligand. 

A "KIM antagonist" is a molecule which can specifically associate with a KIM ligand or 
KIM, thereby blocking or otherwise inhibiting KIM binding to the KIM ligand. The antagonist 
binding blocks or inhibits cellular responses which would otherwise be triggered by ligation of 

1 5 the KIM ligand with KIM or with a KIM agonist. Examples of KIM antagonists include certain 
KIM variants, KIM fusion proteins and specific antibodies to a KIM ligand or KIM. 

A "KIM ligand" is any molecule which noncovalently and specifically binds to a KIM 
protein. Such a ligand can be a protein, peptide, steroid, antibody, amino acid derivative, or other 
type molecule, in any form, including naturally-occurring, recombinantly produced, or otherwise 

20 synthetic. A KIM ligand can be in any form, including soluble, membrane-bound, or part of a 
fusion construct with immunoglobulin, fatty acid, or other moieties. The KIM ligand may be an 
integrin. A membrane-bound KIM ligand can act as a receptor which, when bound to or 
associated with KIM, triggers a cellular response. In some interactions, KIM may associate with 
more than a single KIM ligand, or may associate with a KIM ligand as part of a complex with 

25 one or more other molecules or cofactors. In a situation where both the KIM and the KIM 

ligand are bound to cell membranes, the KIM may associate and react with KIM ligand which is 
bound to the same cell as the KIM, or it may associate and react with KIM ligand be bound to a 
second cell. Where the KIM ligation occurs between molecules bound to different cells, the two 
cells may be the same or different with respect to cellular type or origin, phenotypic or metabolic 

30 condition, or type or degree of cellular response (e.g., growth, differentiation or apoptosis) to a 
given stimulus. "KIM ligation" refers to the contact and binding of KIM with a KIM ligand. 
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By "alignment of sequences" is meant the positioning of one sequence, either nucleotide 
or amino acid, with that of another, to allow a comparison of the sequence of relevant portions of 
one with that of the other. An example of one method of this procedure is given in Needleman et 
al (J. Mol. Biol. 48:443-453, 1970). The method may be implemented conveniently by 
5 computer programs such as the Align program (DNAstar, Inc.). As will be understood by those 
skilled in the art, homologous or functionally equivalent sequences include functionally 
equivalent arrangements of the cysteine residues within the conserved cysteine skeleton, 
including amino acid insertions or deletions which alter the linear ammgement of these cysteines, 
but do not materially impair their relationship in the folded structure of the protein. Therefore, 
1 0 internal gaps and amino acid insertions in the candidate sequence are ignored for purposes of 
calculating the level of amino acid sequence homology or identity between the candidate and 
reference sequences. One characteristic frequently used in establishing the homology of proteins 
is the similarity of the number and location of the cysteine residues between one protein and 
another. 

15 "Antisense DNA" refers to the sequence of chromosomal DNA that is transcribed. 

An "antisense probe" is a probe which comprises at least a portion of the antisense DNA 
for a nucleic acid portion of interest. 

By "cloning" is meant the use of in vitro recombination techniques to insert a particular 
gene or other DNA sequence into a vector molecule. In order to successfully clone a desired 
20 gene, it is necessary to employ methods for generating DNA fragments, for joining the fragments 
to vector molecules, for introducing the composite DNA molecule into a host cell in which it can 
replicate, and for selecting the clone having the target gene from amongst the recipient host cells. 

By "cDNA" is meant complementary or copy DNA produced from an RNA template by 
the action of RNA-dependent DNA polymerase (reverse transcriptase). Thus a "cDNA clone" 
25 means a duplex DNA sequence complementary to an RNA molecule of interest, carried in a 
cloning vector. 

By "cDNA library" is meant a collection of recombinant DNA molecules containing 
cDNA inserts which together comprise a representation of the mRNA molecules present in an 
entire organism or tissue, depending on the source of the RNA templates. Such a cDNA library 
30 may be prepared by methods known to those of skill, and described, for example, in Maniatis et 
al., Molecular Cloning: A Laboratory Manual, fiypia. Generally, RNA is first isolated from the 
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cells of an organism from whose genome it is desired to clone a particular gene. Preferred for the 
purposes of the present invention are mammalian, and particularly human, cell lines. 
Alternatively, RNA may be isolated from a tumor cell, derived from an animal tumor, and 
preferably from a human tumor. Thus, a library may be prepared from, for example, a human 

5 adrenal tumor, but any tumor may be used. 

As used herein, the term "DNA polymorphism" refers to the condition in which two or 
more different nucleotide sequences can exist at a particular site in DNA. 

"Expression vector" includes vectors which are capable of expressing DNA sequences 
contained therein, i.e., the coding sequences are operably linked to other sequences capable of 

10 effecting their expression. It is implied, although not always explicitly stated, that these 

expression vectors must be replicable in the host organisms either as episomes or as an integral 
part of the chromosomal DNA. A useful, but not a necessary, element of an effective expression 
vector is a marker encoding sequence, which is a sequence encoding a protein which results in a 
phenotypic property (such as tetracycline resistance) of the cells containing the protein which 

15 permits those cells to be readily identified. In sum, "expression vector" is given a functional 
definition, and any DNA sequence which is capable of effecting expression of a specified 
contained DNA code is included in this term, as it is applied to the specified sequence. As at 
present, such vectors are frequently in the form of plasmids, thus "plasmid" and "expression 
vector" are often used interchangeably. However, the invention is intended to include such other 

20 forms of expression vectors which serve equivalent functions and which may, from time to time 
become known in the art. 

By "functional derivative" is meant the "fragments", "variants", "analogs", or "chemical 
derivatives" of a molecule. A "fragment" of a molecule, such as any of the antigens of the 
present invention is meant to refer to any polypeptide subset of the molecule. A "variant" of 

25 such molecules is meant to refer to a naturally occurring molecule substantially similar to either 
the entire molecule, or a fragment thereof. An "analog" of a molecule is meant to refer to a 
non-natural molecule substantially similar to either the entire molecule or a fragment thereof. 
The term "gene" means a polynucleotide sequence encoding a peptide. 
By "homogeneous" is meant, when referring to a peptide or DNA sequence, that the 

30 primary molecular structure (i.e., the sequence of amino acids or nucleotides) of substantially all 
molecules present in the composition under consideration is identical. 
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"Isolated" refers to a protein of the present invention, or any gene encoding any such 
protein, which is essentially free of other proteins or genes, respectively, or of other 
contaminants with which it might normally be found in nature, and as such exists in a form not 
found in nature. 

5 The term "label" refers to a molecular moiety capable of detection including, by way of 

example, without limitation, radioactive isotopes, enzymes, luminescent agents, and dyes. 

The term "probe" refers to a ligand of known qualities capable of selectively binding to a 
target antiligand. As applied to nucleic acids, the term "probe" refers to a strand of nucleic acid 
having a base sequence complementary to a target strand. 

1 0 "Recombinant host cells" refers to cells which have been transformed with vectors 

constructed using recombinant DNA techniques. As defined herein, the antibody or modification 
thereof produced by a recombinant host cell is by virtue of this transformation, rather than in 
such lesser amounts, or more commonly, in such less than detectable amounts, as would be 
produced by the untransformed host. 

1 5 By "substantially pure" is meant any protein of the present invention, or any gene 

encoding any such protein, which is essentially free of other proteins or genes, respectively, or of 
other contaminants with which it might normally be found in nature, and as such exists in a form 
not found in nature. 

A molecule is said to be "substantially similar" to another molecule if the sequence of 
20 amino acids in both molecules is substantially the same, and if both molecules possess a similar 
biological activity. Thus, provided that two molecules possess a similar activity, they are 
considered variants as that term is used herein even if one of the molecules contains additional 
amino acid residues not found in the other, or if the sequence of amino acid residues is not 
identical. As used herein, a molecule is said to be a "chemical derivative" of another molecule 
25 when it contains additional chemical moieties not normally a part of the molecule. Such 
moieties may improve the molecule's solubility, absorption, biological half life, etc. The 
moieties may alternatively decrease the toxicity of the molecule, eliminate or attenuate any 
undesirable side effect of the molecule, etc. Moieties capable of mediating such effects are 
disclosed, for example, in Remington's Pharmaceutical Sciences. 16th ed Mack Publishing Co., 
30 Easton,Penn.(1980). 
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By "vector" is meant a DNA molecule, derived from a plasmid or bacteriophage, into 
which fragments of DNA may be inserted or cloned. A vector will contain one or more unique 
restriction sites, and may be capable of autonomous replication in a defined host or vehicle 
organism such that the cloned sequence is reproducible. 

S Compounds of the Invention 

The invention includes the cDNA of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:4 or 
SEQ ID NO:6, as well as sequences which include the sequence of SEQ ID NO: 1, SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:6, and derivatives of these sequences. The invention also 
includes vectors, liposomes and other carrier vehicles which encompass these sequence or 

10 derivatives of them. The invention further includes proteins transcribed from SEQ ID NO:l, 

SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, including but not limited to SEQ ID NO:3, SEQ 
ID NO:5 or SEQ ID NO:7, and their derivatives and variants. 

One embodiment of the invention includes soluble variants of a KIM protein that is 
usually synthesized as a membrane associated protein, and which is upregulated after injury. 

1 5 Soluble variants lack at least a portion of the transmembrane or intra-membrane section of a 
native KIM protein. In some examples, the soluble variant lacks the entire transmembrane or 
intra-membrane section of a native KIM protein. Soluble variants include fusion proteins which 
encompass derivatives of KIM proteins that lack at least a portion of the transmembrane or intra- 
membrane section of a native KIM protein. All types of KIM fusion proteins are included, 

20 particularly those which incorporate his-tag, Ig-tag, and myc-tag forms of the molecule. These 
KIM fusions may have characteristics which are therapeutically advantageous, such as the 
increased half-life conferred by the Ig-tag. Also included are fusion proteins which incorporate 
portions of selected domains of the KIM protein. 

Variants can differ from naturally occurring KIM protein in amino acid sequence or in 

25 ways that do not involve sequence, or both. Variants in amino acid sequence are produced when 
one or more amino acids in naturally occurring KIM protein is substituted with a different natural 
amino acid, an amino acid derivative or non-native amino acid. Particularly preferred variants 
include naturally occurring KIM protein, or biologically active fragments of naturally occurring 
KIM protein, whose sequences differ from the wild type sequence by one or more conservative 

30 amino acid substitutions, which typically have minimal influence on the secondary structure and 
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hydrophobic nature of the protein or peptide. Variants may also have sequences which differ by 
one or more non-conservative amino acid substitutions, deletions or insertions which do not 
abolish the KIM protein biological activity. Conservative substitutions typically include the 
substitution of one amino acid for another with similar characteristics such as substitutions 
5 within the following groups: valine, glycine; glycine, alanine; valine, isoleucine; aspartic acid, 
glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, 
tyrosine. The non-polar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan and methionine. The polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine and glutamine. The positively charged 
1 0 (basic) amino acids include arginine, lysine and histidine. The negatively charged (acidic) amino 
acids include aspartic acid and glutamic acid. 

Other conservative substitutions can be taken from the table below, and yet others are 
described by Dayhoff in the Atlas of Protein Sequence and Structure (1988). 
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TABLE 1: CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly,beta-Ala, L- 
Cys,D-Cys 


Arginine 


R 


D-Arg, Lys,homo-Arg, D- 
homo-Arg, Met,D-Met, He, 

L/-11C, Willi U m vJni 


Asparagine 


N 


D-Asn,Asp,D-Asp,GIu,D- 
uiu, oin,i-/-vjin 


Aspartic Acid 


D 


D-Asp,D-Asn,Asn, Giu,D- 
uiu, vjin, u-uin 


Cysteine 


C 


D-Cys, S-Me-Cys,Met,D- 
iviei, i nr, u- 1 nr 


Glutamine 


Q 


D-Gln,Asn, D-Asn,Glu,D- 

\JlU,/YSp, li-ASp 


Glutamic Acid 


E 


D-Glu,D-Asp,Asp, Asn, D- 
Asn, Gin, D-Gln 


Glycine 


G 


Ala, D-Ala,Pro, D-Pro, Beta- 
Ala, Acp 


Isoleucine 


I 


D-Ile, Val, D-Val, Leu, D- 
Leu, Met, D-Met 


Leucine 


L 


D-Leu, Val, D-Val, Met, D- 
Met 


Lysine 


K 


D-Lys,Arg, D-Arg, homo- 
Arg, D-homo-Arg, Met, 
D_Met, He, D-Ile, Om, D- 
Orn 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, 
Leu, D-Leu, Val, D-Val, 
Norleu 


Phenylalanine 


F 


D-Phe,Tyr, D-Thr,L- 
Dopa,His,D-His, Trp, D-Trp, 
Trans 3,4 or 5-phenylproline, 
cis 3,4 or 5 phenylproline 
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Proline 


P 


D-Pro, L-I-thioazolidine-4- 
carboxylic acid, D- or L-l- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Ser. Thr D-Thr allo-Thr 

*^ till) •»-' iiU) diiir^ l m | 

Met, D-Met, Met(0), D- 
Met(0),Val, D-Val 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, 
Met, D-Met, Met)0, D- 
Met(O), Val, D-Val 


Tyrosine 


Y 


D-Tyr,Phe, D-Phe, L-Dopa, 
His,D-His 


Valine 


V 


D-Val, Leu,D-Leu,Ile,D-Ile, 
Met, D-Met 
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Other variants within the invention are those with modifications which increase peptide 
stability. Such variants may contain, for example, one or more non-peptide bonds (which replace 
the peptide bonds) in the peptide sequence. Also included are: variants that include residues 
other than naturally occurring L-amino acids, such as D-amino acids or non-naturally occurring 

5 or synthetic amino acids such as beta or gamma amino acids and cyclic variants. Incorporation 
of D- instead of L-amino acids into the polypeptide may increase its resistance to proteases. See, 
e.g., U.S. Patent 5,219,990. 

Generally, substitutions that may be expected to induce changes in the functional 
properties of KIM polypeptides are those in which: (I) a hydrophilic residue, e.g., serine or 

10 threonine, is substituted by a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, or 
alanine; (ii) a cysteine residue is substituted for (or by) any other residue; (iii) a residue having 
an electropositive side chain, e.g., lysine, arginine or histidine, is substituted for (or by) a residue 
having an electronegative charge, e.g., glutamic acid or aspartic acid; or (iv) a residue having a 
bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having such a side chain, 

15 e.g., glycine. 

The peptides of this invention may also be modified by various changes such as 
insertions, deletions and substitutions, either conservative or nonconservative where such 
changes might provide for certain advantages in their use. Splice variants are specifically 
included in the invention. 

20 In other embodiments, variants with amino acid substitutions which are less conservative 

may also result in desired derivatives, e.g., by causing changes in charge, conformation and other 
biological properties. Such substitutions would include for example, substitution of hydrophilic 
residue for a hydrophobic residue, substitution of a cysteine or proline for another residue, 
substitution of a residue having a small side chain for a residue having a bulky side chain or 

25 substitution of a residue having a net positive charge for a residue having a net negative charge. 
When the result of a given substitution cannot be predicted with certainty, the derivatives may be 
readily assayed according to the methods disclosed herein to determine the presence or absence 
of the desired characteristics. 

Variants within the scope of the invention include proteins and peptides with amino acid 

30 sequences having at least eighty percent homology with a KIM protein. More preferably the 
sequence homology is at least ninety percent, or at least ninety-five percent For the purposes 
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of determining homology the length of comparison sequences will generally be at least 8 amino 
acid residues, usually at least 20 amino acid residues. Variants of the compounds of the 
invention also includes any protein which 1) has an amino acid sequence which is at least forty 
percent homologous to a KIM protein of the invention, and also which 2) after being placed in an 
5 optimal alignment with the KIM sequence (as depicted in Figure 5 for human and for rat KIM-1 ) 
has at least 80% of its cysteine residues aligned with cysteines in the KIM protein of the 
invention. 

Just as it is possible to replace substituents of the scaffold, it is also possible to substitute 
functional groups which are bound to the scaffold with groups characterized by similar features. 

10 These substitutions will initially be conservative, i.e., the replacement group will have 

approximately the same size, shape, hydrophobicity and charge as the original group. Non- 
sequence modifications may include, for example, in vivo or in vitro chemical derivatization of 
portions of naturally occurring KIM protein, as well as changes in acetylation, methylation, 
phosphorylation, carboxylation or glycosylation. 

15 Also included within the invention are agents which specifically bind to the protein, or a 

fragment of the protein (SEQ ID NO:3, 5 or 7). These agents include ligands and antibodies 
(including monoclonal, single chain, double chain, Fab fragments, and others, whether native, 
human, humanized, primatized, or chimeric). Additional descriptions of these categories of 
agents are in PCT application 95/16709, the specification of which is herein incorporated by 

20 reference. 

Experimental Procedure* 

1- Generation of RNA from ischemic and normal rat adult kidnevs 

Ischemic injured rat kidneys are generated as described by Witzgall et al. (J. Clin Invest. 
25 93: 2 1 75-2 1 88, 1 994). Briefly, the renal artery and vein from one kidney of an adult Sprague- 
Dawley rat are clamped for 40 minutes and then reperfiised. Injured kidneys are harvested from 
the rats at 24 hours and at 48 hours after reperfusion. Kidneys from sham-operated, normal 
adult Sprague-Dawley rats are also harvested. 

Total RNA is prepared from the organs based on the protocol by Glisin et al. 
30 (Biochemistry 13: 2633, 1974). Briefly, the harvested organs are placed immediately into GNC 
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buffer (4M guanidine thiocyanate, 0.5% SDS, 25mM sodium citrate, 0.1% Sigma anti foam) and 
disrupted on ice with a polytron. Cell debris is removed with a low speed spin in a clinical 
centrifuge and the supernatant fluid is placed on a 5.7 M CsCl, 25mM sodium acetate, ImM 
EDTA cushion. RNA is pelleted through the cushion in a S W40Ti rotor at 22K for 1 5hrs. RNA 
5 is resuspended in sterile DEPC- treated water, precipitated twice with 1/10 volume 3M sodium 
acetate and 2.5 volumes of EtOH. Poly A+ RNA is isolated using an mKNA purification kit 
(Pharmacia, catalog No.27-9258-02). 

2. Representational Difference Analysis CRD A) method to isolate 1-7. 3-2 and 4-7 RDA 

fragments 

1 0 Double stranded cDNA is synthesized from sham-operated and from 48hr post-i schemic 

kidney poly A+ RNA using Gibco BRL "Superscript Choice™ System cDNA Synthesis Kit" , 
catalog No. 18090. First strand is synthesized by priming with oligo dT and using Superscript 
II™ reverse transcriptase. Second strand is generated using E. cbli DNA polymerase I and 
RNase H followed by T4 DNA polymerase using BRL recommended conditions. 

1 5 RDA analysis is performed essentially as described by Hubank and Schatz (Nucleic Acid 

Research 22: 5640-48, 1994). Briefly, 48 hr post-ischemic kidney cDNA is digested with the 
restriction enzyme Dpn II, and ligated to R-Bgl-12/ 24 oligonucleotides (see reference for exact 
sequence). PCR amplification (performed with Perkin-Elmer Taq polymerase and their 
corresponding PCR buffer) of the linker ligated cDNA is used to generate the initial 

20 representation. This PCR product is designated "tester amplicon." The same procedure is used 
to generate "driver amplicon" from sham-operated rat kidney cDNA. 

Hybridization of tester and driver amplicons followed by selective amplification are 
performed three times to generate Differential Product One (DPI), Two (DP2) and Three (DP3). 
Generation of the DPI product is performed as described by Hubank and Schatz (Nucleic Acid 

25 Research 22: 5640-48, 1994). The DP2 and DP3 products are also generated as described by 
Hubank and Schatz (id.), except that the driventester ratios are changed to 5,333: 1 for DP2 and 
to 40,000 :1 or 4,000:1 for DP3. 

Three RDA products are cloned from DP3 into the cloning vector pUC 18: RDA product 
1 -7 (252bp) when the DP3 was generated using a ratio of 40,000: 1, and product RDA 3-2 

30 (445bp) and 4-7 (483bp) when the DP3 was generated using a ratio of 4,000: 1 . The DNA 
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fragments are subcloned using the Pharmacia Sureclone™ kit (catalog No. 27-9300-01) to repair 
the ends of the PCR fragments with Klenow enzyme and to facilitate blunt end ligation of the 
fragments into the pUC 1 8 vector. 

3. Northern Analysis 

Poly A+ RNA (2.5ug) from rat normal adult kidney (sham operated) , from 48hr post- 
ishemic injured adult kidney, and from day 18 embryonic kidney is electrophoresed and 
Northern blotted (Cate, Cell 45:685, 1 986) to a GeneScreen™ membrane (Dupont). 
Hybridization in PSB buffer (50mM Tris 7.5, 1M NaCl , 0.1% Na pyrophosphate, 0.2% PVP, 
0.2% Ficoll, 0.2% BSA, 1% SDS), containing 10% dextran sulphate and lOOug/ml tRNA, is 
performed at 65C using three different probes: 1-7 RDA product, 3-2 RDA product and 4-7 RDA 
product. All are radiolabeled using Pharmacia's "Ready to Go™" random priming labeling kit 
(catalog No.27-9251-01). RDA products 1-7, 3-2 and 4-7 hybridize to mRNAs present in all 
three samples, but most intensely to mRNAs in the 48hr post-ischemic kidney RNA samples. 

A Northern blot analysis of adult rat tissues indicates that the 1 -7 gene is expressed at 
very low levels in normal adult kidney, testis, spleen and lung. The 3-2 gene is expressed in 
liver, kidney, spleen, and brain. The 4-7 gene is expressed in spleen, kidney, lung, testis, heart , 
brain, liver, and skeletal muscle. The presence of different sized mRNAs in some tissues in the 
1-7 and 3-2 blot indicates that the primary transcription product of the 1-7 gene and of the 3-2 
gene may undergo alternate splicing and/or polyadenylation. 

4. Isolation of 3-2 and 4-7 cDNA clones 

A cDNA library is generated from 4 ug of polyA+ RNA from 48hr post-ischemic injured 
kidney using reagents from BRL Superscript Choice™ System for cDNA synthesis, and 
Stratagene™ Lambda ZapII cloning kit (catalog No. 236201), according to protocols 
recommended by the manufacturers. 

10 s clones are screened with the 3-2 RDA product as a probe (random primed labeled as 
described above). Eight positive clones are selected and four are randomly chosen for secondary 
analysis to obtain pure phage plaques. After tertiary screening, four pure phage clones are 
isolated. Cloned inserts from the phage are isolated by in vivo excision procedure according to 
Stratagene™ Lambda Zap II kit The largest insert, of approximately 2.6 kb (referred to as 
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cDNA clone 3-2), is subjected to DNA sequencing. The sequence of the insert (SEQ ID NO: 1 ) 
is shown in Figure 1. cDNA clone 3-2 (K coli K-12, SOLR/p3-2#5-l) has been deposited as 
ATCC No. 98061 . The sequence of cDNA clone 3-2 is identical to that of clone 1-7 cDNA 
(SEQ ID NO: 2), except that nucleotides 136-605 of SEQ ID NO:l represent an insertion. Thus, 

5 SEQ ID NO:2 represents a splice variant form of SEQ ID NO: 1 . The clone for 1-7 (£. coli K-12, 
SOLR/pl-7#3-l) has been deposited as ATCC No. 98060. 

10 3 clones are screened with the 1-7 RDA product as a probe (random primed 
radiolabeled as described above). Eight positive clones are selected and four are randomly 
chosen for secondary analysis to obtain pure phage plaques. After tertiary screening, four pure 

10 phage clones are isolated. Cloned inserts from the phage are isolated by in vivo excision 

procedure according to Stratagene™ Lambda Zap II kit. The largest insert of approximately 2.0 
kb (referred to as cDNA clone 1-7) is subjected to DNA sequencing; the sequence of the insert 
(SEQ ID NO: 2) is shown in Figure 2. 

10 3 clones are screened with the 4-7 RDA product as a probe (random primed labeled as 

1 5 described above and hybridized in PSB at 65C). Eight positive clones are selected and four are 
randomly chosen for secondary analysis to obtain pure phage plaques. After secondary 
screening, two pure phage clones are isolated. Cloned inserts from the phage are isolated by in 
vivo excision procedure according to Stratagene™ Lambda Zap II kit. The largest insert, 
approximately 2.4 kb (referred to as cDNA clone 4-7), is subjected to DNA sequencing. The 

20 sequence of the insert, SEQ ID NO: 4, is shown in Figure 3. The cDNA clone 4-7 (£. coli K-12, 
SOLR/p4-7#l-l) has been deposited as ATCC No. 98062.. 

5, Charactmzation of the 1-7. 3-2 and 4-7 cDNA clones 
A.) DNA and Protein Sequences: 

The sequence of 3-2 cDNA (Figure 1 ; SEQ ID NO:l) contains an open reading frame of 
25 307 amino acids (Figure 1 ; SEQ ID NO:3). A signal sequence of 21 amino acids is inferred from 
Von Heijne analysis (Von Heijne et al., Nucl. Acid Res. 14:14683 (1986)), and a transmembrane 
region spanning approximately aa 235-257 indicates that the 3-2 product is a cell surface protein. 

The sequence of 1-7 cDNA (Figure 2; SEQ ID NO:2) contains an open reading frame of 
307 amino acids, which is identical to the open reading frame contained in the 3-2 cDNA (SEQ 
30 ID NO: 3) . The sequence of 4-7 cDNA (Figure 3; SEQ ID NO:4) contains an open reading 
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frame of 572 amino acids (SEQ ID NO:5). A transmembrane region is located at approximately 
amino acids 501-521. 

B.) In situ analysis of 1-7, 3-2 and 4-7 mRNAs in contralateral and in post-ischemic adult 
rat kidneys: 

5 In situ hybridization is carried out according to the method described by Finch et al., Dev. 

Dynamics 203: 223-240, 1995. Briefly, both ischemic and contralateral kidneys are perfusion 
fixed with 4% paraformaldehyde in PBS. Kidneys are further fixed overnight at 4C and 
processed. Paraffin sections are deparaffinized and rehydrated, fixed with 4% paraformaldehyde 
in PBS, digested with proteinase K, refixed, then acetylated with acetic anhydride in 

1 0 triethanolamine buffer. Sections are then dehydrated and hybridized with 32 P-labeled riboprobes 
at 55 °C overnight, with 33P-labeled riboprobes generated from 3-2 RDA or 1-7 RDA products 
subcloned into BamHl site of pGEM-1 1Z. After hybridization, sections were washed under 
high stringency conditions (2 X SSC, 50 % formamide at 65°C). Sections are finally dehydrated, 
emulsion (NBT-2) coated for autoradiography, and exposed for at least a week. Silver grains are 

1 5 developed and sections are counterstained with toluidine blue and microphotographed. 

Analysis of 1-7 and 3-2 mRNA expression by in situ hybridization indicates that 
these genes are greatly upregulated in damaged kidney cells compared to their 
expression in normal kidney sections. The expression seen is in regenerative cells of the cortex 
and outer medulla, most of which appear to be proximal tubule cells. 

20 Analysis of the 4-7 in situ RNA expression pattern also reveals abundant expression of 

this gene in the injured ischemic kidney compared to the normal adult kidney. The site of 
expression appears to be infiltrating cells. 

Isolation of a human cDNA clone which cross hybridizes to the rat 3-2 cDNA 

A 32 P-labeled DNA probe comprising nucleotides 546-969 of the insert of clone 3-2 
25 shown in Figure 1 is generated and used to screen a human embryonic liver lambda gtlO cDNA 
library (Clontech Catalog #HL5003a). 1 XI 0 6 plaques are screened in duplicate using standard 
conditions as described above but temperature for screening was 55C For the high stringency 
wash, the filters are washed in 2X SSC at 55C. Fifty positive phage are identified and plaque 
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purified, and DNA is prepared. The phage DNAs are subjected to Southern analysis using the 
same probe as above. The Southern blot filter is subjected to a final wash with 0.5X SSC at 55C. 
Two clones are identified as positive. The insert of clone HI 3-1 0-85 is sequenced and a region is 
found that encodes a protein with a high level of identity to the 3-2 protein shown in Figure 3. 
5 The nucleotide sequence (SEQ ID NO:6) and predicted amino acid sequence (SEQ ID 

NO:7) of the human 3-2 related protein are shown in Figure 4. As shown by the bestfit analysis 
depicted in Figure 5, the human 3-2 related protein is 43.8% identical and 59.1% similar to the 
rat 3-2 protein. Both contain IgG, mucin, transmembrane, and cytoplasmic domains. The six 
cysteines within the IgG domains of both proteins are conserved. 

10 7) Production of KIM- 1 Ig fasion protein 

A fusion protein of the extracellular domain of KIM and the Fc region of 
immunoglobulin (Ig) is a useful tool for the study of the molecular and cellular biology of the 
injured/regenerating kidney and as a therapeutic molecule. To produce Kim Ig fusion protein 
with the extracellular domain of human and rat KIM-1 protein, a fragment of the extracellular 

1 5 domain of KIM-1 cDN A was amplified by PGR and cloned in the Biogen expression vector, 
pCA125, for transient expression in COS cells. The expression vector pCA125 produces a fusion 
protein which has a structure from gene cloned at N-terminus and a human Ig Fc region at the C- 
terminus. COS cells were transfected with the plasmids SJR 103 or 104; these plasmids express a 
fusion protein which contains the human KIM sequences 263-1 147 (SEQ ID NO:6; SJR 103) or 

20 rat KIM sequences 599-1319 (SEQ ID NO:l ; SJR 104) of the extracellular domain fused to 
human Ig Fc region. The cells were grown in 10% FBS in DMEM in the cell factory (Nunc, 
Naperville, Il).Two to three days post-transfection, medium was harvested, concentrated using 
Amicon concentrator, and fusion protein was purified using Protein-A Sepharose column. After 
purification, purity of fusion protein was evaluated by SDS-PAGE. 

25 Diagnostic Uses of the Compounds of the Invention 

Anti-KIM antibodies of the invention, which specifically bind to the protein of SEQ ID 
NO:3, SEQ ID NO:5 or SEQ ID NO:7 or a fragment thereof, are useful in several diagnostic 
methods. These agents may be labeled with detectable markers, such as fluoroscopically or 
radiographically opaque substances, and administered to a subject to allow imaging of tissues 
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which express KIM protein. The agents may also be bound to substances, such as horseradish 
peroxidase, which can be used as immunocytochemical stains to allow visualization of areas of 
KIM protein-positive cells on histological sections. A specific antibody could be used alone in 
this manner, and sites where it is bound can be visualized in a sandwich assay using an anti- 
5 immunoglobulin antibody which is itself bound to a detectable marker. 

Specific antibodies to KIM protein are also useful in immunoassays to measure KIM 
presence or concentration in samples of body tissues and fluids. Such concentrations may be 
correlated with different disease states. As an embodiment of particular interest, the invention 
includes a method of diagnosing renal injury, or of monitoring a process of renal repair, by 

1 0 measuring the concentration of KIM or of KIM fragments in the urine, plasma or serum of a 
patient. Similarly, KIM can be measured in urine sediment, in particular in cellular debris in the 
urine sediment. Casts of renal tubule cells, which may be present in urine sediment from patients 
with ongoing renal disease, may contain elevated levels of KIM protein and mRNA. 

Specific antibodies to KIM protein may also be bound to solid supports, such as beads or 

1 5 dishes, and used to remove the ligand from a solution, either for measurement, or for purification 
and characterization of the protein or its attributes (such as postradiational modifications). 
Such characterization of a patient's KIM protein might be useful in identifying deleterious 
mutants or processing defects which interfere with KIM function and are associated with 
abnormal patient phenotypes. Each of these techniques is routine to those of skill in the 

20 immunological arts. 

Additional imaging methods utilize KIM or KIM fragments, fused to imageable moieties, 
for diagnostic imaging of tissues that express KIM ligands, particularly tumors. 

Further diagnostic techniques are based on demonstration of upregulated KIM mRNA in 
tissues, as an indication of injury-related processes. This technique has been tested and found 

25 workable in a model of ischemic injury in rats, as follows. 

To determine if the amount of KIM- 1 protein is increased after injury, we examined 
kidney homogenates of contralateral and postishemic kidneys 24 and 48 hours following a 40 
minute clamping of the renal artery and vein of a single kidney for each rat The kidney 
homogenate was assessed for the presence of KIM-1 protein. Western blot analysis identifies 

30 three proteins detected by two different antibodies after ischemic injury, which are not detectable 
in homogenates from contralateral kidneys which were not exposed to ischemic injury. The 
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apparent molecular weights of the bands are approximately 40kDa, SOkDa and 70-80kDa. The 
three protein species detected by western blotting could represent glycosylated forms of the same 
protein given the presence of potential N and O linked glycosylation sites. The fact that each of 
these proteins react with two different sets of polyclonal antibodies supports the idea that they 

S are related to KIM-1 and are not cross-reacting bands. Confirmation of this prediction came 
from the results of partial CNBr cleavage of the three proteins which revealed they shared 
common CNBr cleavage fragments. Since the cytoplasmic domain of the KIM-1 protein is not 
predicted to contain any major post-translational modifications, the two smallest products of the 
digest (4.7kDa and 7.4kDa) detected with antibodies directed against the cytoplasmic domain of 

1 0 KIM-1 should be the same size for the three different KIM- 1 protein bands if they originate from 
the same protein. We observed that the KIM1 40kDa and 70-80kDa proteins yield fragments 
migrating at the predicted size. Digest of the SOkDa protein band gave also the same C -terminal 
signature band peptide. 

The KIM-1 sequence presents two putative sites for N-glycosylation and a mucin domain 

15 where O-glycosylation could cover the polypeptide chain. The three KIM-1 bands detected in 
postischemic kidney could correspond to glycosylation variants of the same core protein. De-N- 
glycosylation with PNGase F resulted in a shift of all three bands to a lower molecular weight, 
corresponding to a loss of about 3kDa, indicating that all three proteins are N-glycosylated. 
Differences in O-glycosylation might explain the differences in sizes of these three bands. 

20 Therapeutic Uses of the Compounds of the Invention 

The therapeutic methods of the invention involve selectively promoting or inhibiting 
cellular responses that are dependent on KIM ligation. Where the KIM and the KIM ligand are 
both membrane bound, and expressed by different cells, the signal transduction may occur in the 
KIM-expressing cell, in the KIM ligand-expressing cell, or in both. 
25 KIM ligation-triggered response in a KIM ligand-expressing cell may be generated by 

contacting the cell with exogenous KIM, KIM fusion proteins or activating antibodies against 
KIM ligand, either in vitro or in vivo. Further, responses of the KIM ligand-expressing cell that 
would otherwise be triggered by endogenous KIM could be blocked by contacting the KIM 
ligand-expressing cell with a KIM ligand antagonist (e.g., an antagonist antibody that binds to 
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KIM ligand), or by contacting the endogenous KIM with an anti-KIM antibody or other KIM- 
binding molecule which prevents the effective ligation of KIM with a KIM ligand. 

Similarly, the responses triggered by KIM ligation in the KIM-expressing cell may be 
promoted or inhibited with exogenous compounds. For example, KIM ligation-triggered 
5 response in a KIM-expressing cell may be generated by contacting the cell with a soluble KIM 
ligand, or certain anti-KIM activating antibodies. Further, responses of the KIM-expressing cell 
that would otherwise be triggered by interaction with endogenous KIM ligand could be blocked 
by contacting the KIM-expressing cell with an antagonist to KIM (e.g.., a blocking antibody that 
binds to KIM in a manner that prevents effective, signal-generating KIM ligation), or by 

10 contacting the endogenous KIM ligand with an anti-KIM ligand antibody or other KIM ligand- 
binding molecule which prevents the effective ligation of KIM with the KIM ligand. 

Which of the interventions described above are useful for particular therapeutic uses 
depend on the relevant etiologic mechanism of either the pathologic process to be inhibited, or of 
the medically desirable process to be promoted, as is apparent to those of skill in the medical 

1 5 arts. For example, where KIM ligation results in desirable cellular growth, maintenance of 
differentiated phenotype, resistance to apoptosis induced by various insults, or other medically 
advantageous responses, one of the above-described interventions that promote ligation-triggered 
response may be employed. In the alternative, one of the inhibitory interventions may be useful 
where KIM ligation invokes undesirable consequences, such as neoplastic growth, deleterious 

20 loss of cellular function, susceptibility to apoptosis, or promotion of inflammation events. 

Following are examples of the previously described therapeutic methods of the invention. 
One therapeutic use of the KIM-related compounds of the invention is for treating a subject with 
renal disease, promoting growth of new tissue in a subject, or promoting survival of damaged 
tissue in a subject, and includes the step of administering to the subject a therapeutically 

25 effective amount of a KIM protein of the invention, or of a pharmaceutical composition which 
includes a protein of the invention. The protein used in these methods may be a fragment of a 
full-length KIM protein, a soluble KIM ligand protein or fusion fragment, or a KIM agonist. 
These methods may also be practiced by administering to the subject a therapeutically effective 
amount of an agonist antibody of the invention, or a pharmaceutical composition which includes 

30 an agonist antibody of the invention. A KIM protein may be administered concurrently with a 
therapeutically effective amount of a second compound which exerts a medically desirable 
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adjunct effect. While tissues of interest for these methods may include any tissue, preferred 
tissues include renal tissue, liver, neural tissue, heart, stomach, small intestine, spinal cord, or 
lung. Particular renal conditions which may be beneficially treated with the compounds of the 
invention include acute renal failure, acute nephritis, chronic renal failure, nephrotic syndrome, 
5 renal tubule defects, kidney transplants, toxic injury, hypoxic injury, and trauma. Renal tubule 
defects include those of either hereditary or acquired nature, such as polycystic renal disease, 
medullary cystic disease, and medullary sponge kidney. This list is not limited, and may include 
many other renal disorders (see, e.g., Harrison's Principles of Internal Medicine, 13th ed., 1994, 
which is herein incorporated by reference.) The subject of the methods may be human. 

10 A therapeutic intervention for inhibiting growth of undesirable, KIM ligand-expressing 

tissue in a subject includes the step of administering to the subject a therapeutically effective 
amount of a KIM antagonist (e.g.., an antagonist antibody that binds to KIM ligand), or by 
administering a therapeutically effective amount of an anti-KIM antibody or other KIM-binding 
molecule which blocks KIM binding to the KIM ligand-expressing tissue. In an embodiment of 

IS interest, the KIM antagonist or anti-KIM antibody may be used therapeutically to inhibit or block 
growth of tumors which depend on KIM protein for growth. 

Other methods of the invention include killing KIM ligand-expressing tumor cells, or 
inhibiting their growth, by contacting the cells with a fusion protein of a KIM and a toxin or 
radionuclide, or an anti-KIM ligand antibody conjugated to a toxin or radionuclide. The cell may 

20 be within a subject, and the protein or the conjugated antibody is administered to the subject. 

Also encompassed within the invention is a method for targeting a toxin or radionuclide 
to a cell expressing a KIM, comprising contacting the cell with a fusion protein comprising a 
KIM ligand and a toxin or radionuclide, or an anti-KIM antibody conjugated to a toxin or 
radionuclide. Another embodiment includes the method of suppressing growth of a tumor cell 

25 which expresses KIM, comprising contacting the cell with a fusion protein of KIM ligand and a 
toxin or radionuclide or with an anti-KIM antibody conjugated to a toxin or radionuclide; the cell 
may be within a subject, and the protein administered to the subject. 

The term "subject" used herein is taken to mean any mammal to which KIM may be 
administered. Subjects specifically intended for treatment with the method of the invention 

30 include humans, as well as nonhuman primates, sheep, horses, cattle, goats, pigs, dogs, cats, 
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rabbits, guinea pigs, hamsters, gerbils, rats and mice, as well as the organs, tumors, and cells 
derived or originating from these hosts. 

Use Of Compounds of the Invention i n Gene Therapy 

The KIM genes of the invention are introduced into damaged tissue, or into tissue where 
stimulated growth is desirable. Such gene therapy stimulates production of KIM protein by the 
transfected cells, promoting cell growth and/or survival of cells that express the KIM protein. 

In a specific embodiment of a gene therapy method, a gene coding for a KIM protein may 
be introduced into a renal target tissue. The KIM protein would be stably expressed and 
stimulate tissue growth, division, or differentiation, or could potentiate cell survival. 
Furthermore, a KIM gene may be introduced into a target cell using a variety of well-known 
methods that use either viral or non-viral based strategies. 

Non-viral methods include electroporation, membrane fusion with liposomes, high 
velocity bombardment with DNA-coated microprojectiles, incubation with calcium-phosphate- 
DNA precipitate, DEAE-dextran mediated transfection, and direct micro-injection into single 
cells. For instance, a KIM gene may be introduced into a cell by calcium phosphate 
coprecipitation (Pillicer et al., Science, 209: 1414-1422 (1980); mechanical microinjection and/or 
particle acceleration (Anderson et al., Proc. Nat Acad. Sci. USA, 77: 5399-5403 0980); 
liposome based DNA transfer (e.g., LIPOFECTIN-mediated transfection- Fefgner et al., Proc. 
Nat. Acad. Sci., USA, 84: 471-477, 1987; Gao and Huang, Biochim. Biophys. Res. Comm., 179: 
280-285, 1991; DEAE Dextran-mediated transfection; electroporation (U.S. Patent 4,956,288); or 
polylysine-based methods in which DNA is conjugated to deliver DNA preferentially to liver 
hepatocytes (Wolff et al., Science, 247: 465-468, 1990; Curiel et al., Human Gene Therapy 3: 
' 147-154,1992). 

Target cells may be transfected with the genes of the invention by direct gene transfer. 
See, e.g., Wolff et al., "Direct Gene Transfer Into Moose Muscle In Vivo", Science 247:1465-68, 
1990. In many cases, vector-mediated transfection will be desirable. Any of the methods known 
in the art for the insertion of polynucleotide sequences into a vector may be used. ( See, for 
example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, 1989; and Ausubel et al., Current Protocols in Molecular 
Biology* J. Wiley & Sons, NY, 1992, both of which are incorporated herein by reference.) 
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Promoter activation may be tissue specific or inducible by a metabolic product or administered 
substance. Such promoters/enhancers include, but are not limited to, the native c-ret ligand 
protein promoter, the cytomegalovirus immediate-early promoter/enhancer (Karasuyama et al., J. 
Exp. Med., 169: 13, 1989); the human beta-actin promoter (Gunning et al., Proc. Nat. Acad. Sci. 
5 USA, 84: 483 1 , 1987; the glucocorticoid-inducible promoter present in the mouse mammary 
tumor virus long terminal repeat (MMTV LTR) (Klessig et al., Mol. Cell. Biol., 4: 1354, 1984); 
the long terminal repeat sequences of Moloney murine leukemia virus (MuLV LTR) (Weiss et 
al., RNA Tumor Viruses, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1985); the 
SV40 early region promoter (Bernoist and Chambon, Nature, 290:304, 1981); the promoter of the 

10 Rous sarcoma virus (RSV) (Yamamoto et al., Cell, 22:787, 1980); the herpes simplex virus 
(HSV) thymidine kinase promoter (Wagner et al., Proc. Nat. Acad. Sci. USA, 78: 1441 , 1981); 
the adenovirus promoter (Yamada et al., Proc. Nat Acad. Sci. USA, 82: 3567, 1985). 

The KIM genes may also be introduced by specific viral vectors for use in gene transfer 
systems which are now well established. See for example: Madzak et al., J. Gen . Virol., 73: 

1 5 1 533-36, 1 992 (papovavirus SV40); Berkner et al., Curr. Top. Microbiol. Immunol., 1 58: 39-6 1 , 
1992 (adenovirus); Hofinann et al., Proc. Natl. Acad. Sci. 92: 10099-10103, 1995 (baculovirus); 
Moss et al., Curr. Top. Microbiol. Immunol., 158: 25-38, 1992 (vaccinia virus); Muzyczka, Curr. 
Top. Microbiol. Immunol, 158: 97-123, 1992 ( adeno-associated virus); Margulskee, Curr. Top. 
Microbiol. Immunol., 158: 67-93, 1992 (herpes simplex virus (HSV) and Epstein-Barr virus 

20 (HBV)); Miller, Curr. Top. Microbiol. Immunol., 158: 1-24, 1992 (retrovirus); Brandyopadhyay 
et al.,Mol. Ceil. Biol., 4: 749-754, 1984 (retrovirus); Miller et al., Nature, 357: 455-450, 1992 
(retrovirus); Anderson, Science, 256: 808-813, 1992 (retrovirus), Current Protocols in Molecular 
Biology: Sections 9.10-9.14 (Ausubel et al., Eds.), Greene Publishing Associates, 1989, all of 
which are incorporated herein by reference. 

25 Preferred vectors are DNA viruses that include adenoviruses (preferably Ad-2 or Ad-5 

based vectors), baculovirus, herpes viruses (preferably herpes simplex virus based vectors), and 
parvoviruses (preferably "defective" or non-autonomous parvovirus based vectors, more 
preferably adeno-associated virus based vectors, most preferably AAV-2 based vectors). See, 
e.g., Ali et al., Gene Therapy 1: 367-384, 1994; U.S. Patent 4,797,368 and 5,399,346 and 

30 discussion below. 
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The choice of a particular vector system for transferring, for instance, a KIM sequence will 
depend on a variety of factors. One important factor is the nature of the target cell population. 
Although retroviral vectors have been extensively studied and used in a number of gene therapy 
applications, they are generally unsuited for infecting cells that are not dividing but may be 
useful in cancer therapy since they only integrate and express their genes in replicating cells. 
They are useful for vivo approaches and are attractive in this regard due to their stable 
integration into the target cell genome. 

Adenoviruses are eukaryotic DNA viruses that can be modified to efficiently deliver a 
therapeutic or reporter transgene to a variety of cell types. The general adenoviruses types 2 and 
5 (Ad2 and Ad5> respectively), which cause respiratory disease in humans, are cuiTently being 
developed for gene therapy of Duchenne Muscular Dystrophy (DMD)and Cystic Fibrosis (CF). 
Both Ad2 and Ad5 belong to a subclass of adenovirus that are not associated with human 
malignancies. Adenovirus vectors are capable of providing extremely high levels of transgene 
delivery to virtually all cell types, regardless of the mitotic state. High titers (10 13 plaque 
forming units/ml) of recombinant virus can be easily generated in 293 cells (an adenovirus- 
transformed, complementation human embryonic kidney cell line: ATCC CRL1573) and cryo- 
stored for extended periods without appreciable losses. The efficacy of this system in delivering 
a therapeutic transgene in vivo that complements a genetic imbalance has been demonstrated in 
animal models of various disorders. See Watanabe, Atherosclerosis, 36: 261-268, 1986; 
Tanzawa et al., FEBS Letters 1 18(l):81-84, 1980; Golasten et al., New EngU. Med. 309:288- 
296, 1983; Ishibashi et al., J. Clin. Invest. 92: 883-893, 1993; and Ishibashi et al., J. Clin. Invest 
93: 1889-1893, 1994, all of which are incorporated herein by reference. Indeed, recombinant 
replication defective adenovirus encoding a cDNA for the cystic fibrosis transmembrane 
regulator (CFTR) has been approved for use in at least two human CF clinical trials. See, e.g., 
Wilson, Nature 365:691-692 , 1993. Further support of the safety of recombinant adenoviruses 
for gene therapy is the extensive experience of live adenovirus vaccines in human populations. 

The first-generation recombinant, replication-deficient adenoviruses which have been 
developed for gene therapy of DMD and other inherited disorders contain deletions of the entire 
Ela and part of the Elb regions. This replication-defective virus is grown in 293 cells containing 
a functional adenovirus Ela gene which provides a transacting Ela protein. El-deleted viruses 
are capable of replicating and producing infectious virus in the 293 cells, which provide Ela and 
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Elb region gene products in trans. The resulting virus is capable of infecting many cell types 
and can express the introduced gene (providing it carries its own promoter), but cannot replicate 
in a cell that does not carry the El region DNA unless the cell is infected at a very high 
multiplicity of infection. Adenoviruses have the advantage that they have a broad host range, can 

S infect quiescent or terminally differentiated cells such as neurons, and appear essentially non- 
oncogenic. Adenoviruses do not appear to integrate into the host genome. Because they exist 
extrachromasornally, the risk of insertional mutagenesis is greatly reduced. AH et aL, supra, at 
373, Recombinant adenoviruses (rAdV) produce very high titers, the viral particles are 
moderately stable, expression levels are high, and a wide range of cells can be infected. Their 

1 0 natural host cells are airway epithelium, so they are useful for therapy of lung cancers. 

Baculovirus-mediated transfer has several advantages. Baculoviral gene transfer can occur 
in replicating and nonreplicating cells, and can occur in renal cells, as well as in hepatocytes, 
neural cells, spleen, skin, and muscle. Baculovirus is non-replicating and nonpathogenic in 
mammalian cells. Humans lack pre-existing antibodies to recombinant baculovirus which could 

1 S block infection. In addition, baculovirus is capable of incorporating and transducing very large 
DNA inserts. 

Adeno-associated viruses (AAV) have also been employed as vectors for somatic gene 
therapy. AAV is a small, single-stranded (ss) DNA virus with a simple genomic organization (4- 
7 kb) that makes it an ideal substrate for genetic engineering. Two open reading frames encode a 

20 series of rep and cap polypeptides. Rep polypeptides (rep78, rep68 9 rep 62 and rep 40) are 

involved in replication, rescue and integration of the AAV genome. The cap proteins (VP1 , VP2 
and VP3) form the virion capsid. Flanking the rep and cap open reading frames at the 5' and 3' 
ends are 145 bp inverted terminal repeats (ITRs), the first 125 bp of which are capable of 
forming Y- or T-shaped duplex structures. Of importance for the development of AAV vectors, 

25 the entire rep and cap domains can be excised and replaced with a therapeutic or repon r 

transgene. See B.J. Carter, in Handbook of Parvoviruses, ed., P. Tijsser, CRC Press, pp. i 55-168 
(1990). It has been shown that the ITRs represent the minimal sequence required for replication, 
rescue, packaging, and integration of the AAV genome. 

Adeno-associated viruses (AAV) have significant potential in gene therapy. The viral 

30 particles are very stable and recombinant AAVs (rAAV)have "drug-like" characteristic* in that 
rAAV can be purified by pelleting or by CsCl gradient banding. They are heat stable and can be 
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lyophilized to a powder and rehydrated to full activity. Their DNA stably integrates into host 
chromosomes so expression is long-term. Their host range is broad and AAV causes no known 
disease so that the recombinant vectors are non-toxic. 

Once introduced into a target cell, sequences of interest can be identified by conventional 
5 methods such as nucleic acid hybridization using probes comprising sequences that are 

homologous/complementary to the inserted gene sequences of the vector. In another approach, 
the sequence(s) may be identified by the presence or absence of a "marker" gene function (e.g, 
thymidine kinase activity, antibiotic resistance, and the like) caused by introduction of the 
expression vector into the target cell. 

10 Formulations and Administration 

The compounds of the invention are formulated according to standard practice, such as 
prepared in a carrier vehicle. The term "pharmacologically acceptable carrier" means one or 
more organic or inorganic ingredients, natural or synthetic, with which the mutant proto- 
oncogene or mutant oncoprotein is combined to facilitate its application. A suitable carrier 

1 5 includes sterile saline although other aqueous and non-aqueous isotonic sterile solutions and 

sterile suspensions known to be pharmaceutical^ acceptable are known to those of ordinary skill 
in the art. In this regard, the term "carrier" encompasses liposomes and the HIV-1 tat protein 
(See Chen et al., Anal. Biochem. 227: 168-175, 1995) as well as any plasmid and viral 
expression vectors. 

20 Any of the novel polypeptides of this invention may be used in the form of a 

pharmaceutical^ acceptable salt Suitable acids and bases which are capable of forming salts 
with the polypeptides of the present invention are well known to those of skill in the art, and 
include inorganic and organic acids and bases. 

A compound of the invention is administered to a subject in a therapeutically-effective 

25 amount, which means an amount of the compound which produces a medically desirable result 
or exerts an influence on the particular condition being treated. An effective amount of a 
compound of the invention is capable of ameliorating or delaying progression of the diseased, 
degenerative or damaged condition. The effective amount can be determined on an individual 
basis and will be based, in part, on consideration of the physical attributes of the subject, 
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symptoms to be treated and results sought. An effective amount can be determined by one of 
ordinary skill in the art employing such factors and using no more than routine experimentation. 

A liposome delivery system for a compound of the invention may be any of a variety of 
unilamellar vesicles, multilamellar vesicles, or stable plurilamellar vesicles, and may be prepared 
S and administered according to methods well known to those of skill in the art, for example in 
accordance with the teachings of United States Patent 5,169,637, 4,762,915, 5,000,958 or 
5,1 85,1 54, In addition, it may be desirable to express the novel polypeptides of this invention, as 
well as other selected polypeptides, as lipoproteins, in order to enhance their binding to 
liposomes. As an example, treatment of human acute renal failure with liposome-encapsulated 

1 0 KIM protein may be performed in vivo by introducing a KIM protein into cells in need of such 
treatment using liposomes. The liposomes can be delivered via catheter to the renal artery. The 
recombinant KIM protein is purified, for example, from CHO cells by immunoaffinity 
chromatography or any other convenient method, then mixed with liposomes and incorporated 
into them at high efficiency. The encapsulated protein may be tested in vitro for any effect on 

1 5 stimulating cell growth. 

The compounds of the invention may be administered in any manner which is medically 
acceptable. This may include injections, by parenteral routes such as intravenous, intravascular, 
intraarterial, subcutaneous, intramuscular, intratumor, intraperitoneal, intraventricular, 
intraepidural, or others as well as oral, nasal, ophthalmic, rectal, or topical. Sustained release 

20 administration is also specifically included in the invention, by such means as depot injections or 
erodible implants. Localized delivery is particularly contemplated, by such means as delivery 
via a catheter to one or more arteries, such as the renal artery or a vessel supplying a localized 
tumor. 

Although the foregoing invention has been described in some detail by way of illustration 
25 and example for purposes of clarity of understanding, it will be obvious to one skilled in the art 
that certain changes ami modifications may be practiced within the scope of the invention, as 
limited only by the scope of the appended claims. 
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SEQUENCE LISTING 



(X) GENERAL INFORMATION: 

(i) APPLICANT: Michele Sanicola-Nadel 
Joseph V. Bonventre 
Catherine A. Hess ion 
Takaharu Ichimura 
Henry Wei 
Richard L. Cate 

(ii) TITLE OF INVENTION: MODULATORS OF TISSUE REGENERATION 

(iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

<A) ADDRESSEE: Biogen, Inc. 

(B) STREET: 14 Cambridge Center 
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(E) COUNTRY: USA 

(F) ZIP: 02142 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patent In Release #1.0 , Version #1.30 
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(A) APPLICATION NUMBER : 

(B) FILING DATE: 23 -MAY- 1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 
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(viii) ATTORNEY/AGENT INFORMATION: 
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(ix) TELECOMMUNICATION INFORMATION: 
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(B) TELEFAX: (617) 679-2838 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 615.. 1535 



-32 < 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCGGCCGCGT CGACGGTGCC TGTGAGTAAA TAGATCAGGG TCTCCTTCAC AGCACATTCT 60 

CCAGGAAGCC GAGCAAACAT TAGTGCTATT TTACCCAGGA GGAAATCTAG GTGTAGAGAG 120 

CTCTACGGAT CTAAGGTTTG GATCTGTACC CAGTGCTTTT TTAGGTGTCT TTAGACATTT 180 

CTCAGGAAGA TGTAGTCTCT GTCACCATGT GTGGCTGAAT TCTAGCTCAG TCCATCTTAT 240 

TGTGTTTAAG GTAGTTGAAG TTTAGGAACC AACCAGTATG TCTCTGAGCA GAAGAGTACA 300 

GTGTCCATCT TGAGGACAAG CTCATCTTTA CCATTAGAGG GCTGGCCTTG GCTTAGATTC 360 

TACCGAGAAC ATACTCTCTA ATGGCTGCCC TCAGTTTTCT CTGTTTGCTG TCTTATTTGT 420 

GTCATGGCCA GAAGTCATAT GGATGGCTCT ATGTGAGCAA GGACCCAGAT AGAAGAGTGT 480 

ATTTGGGGGA ACAGGTTGCC CTAACAGAGA GTCCTGTGGG ATTCATGCAG TCAGGATGAA 540 

GACCTGATCA GACAGAGTGT GCTGAGTGCC ACGGCTAACC AGAGTGACTT GTCACTGTCC 600 

TTCAGGTCAA CACC ATG GTT CAA CTT CAA GTC TTC ATT TCA GGC CTC CTG 650 
Met Val Gin Leu Gin Val Phe He Ser Gly Leu Leu 
15 10 

CTG CTT CTT CCA GGC TCT GTA GAT TCT TAT GAA GTA GTG AAG GGG GTG 698 
Leu Leu Leu Pro Gly Ser Val Asp Ser Tyr Glu Val Val Lys Gly Val 
15 20 25 

GTG GGT CAC CCT GTC ACA ATT CCA TGT ACT TAG TCA ACA CGT GGA GGA 746 
Val Gly His Pro Val Thr lie Pro Cys Thr Tyr Ser Thr Arg Gly Gly 
30 35 40 

ATC ACA ACG ACA TGT TGG GGC CGG GGG CAA TGC CCA TAT TCT AGT TGT 794 
lie Thr Thr Thr Cys Trp Gly Arg Gly Gin Cys Pro Tyr Ser Ser Cys 
45 50 55 60 

CAA AAT ATA CTT ATT TGG ACC AAT GGA TAC CAA GTC ACC TAT CGG AGC 842 
Gin Asn lie Leu lie Trp Thr Asn Gly Tyr Gin Val Thr Tyr Arg Ser 
65 70 75 



AGC GGT CGA TAC AAC ATA AAG GGG CGT ATT TCA GAA GGA GAC GTA TCC 
Ser Gly Arg Tyr Asn lie Lys Gly Arg lie Ser Glu Gly Asp Val Ser 
80 85 90 



890 
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TTG ACA ATA GAG AAC TCT GTT GAT AGT GAT AGT GGT CTG TAT TGT TGC 938 
Leu Thr lie Glu Asn Ser Val Asp Ser Asp Ser Gly Leu Tyr Cys Cys 
95 100 105 

CGA GTG GAG ATT CCT GGA TGG TTC AAC GAT CAG AAA ATG ACC TTT TCA 986 
Arg Val Glu lie Pro Gly Trp Phe Asn Asp Gin Lys Met Thr Phe Ser 
110 us 120 

TTG GAA GTT AAA CCA GAA ATT CCC ACA AGT CCT CCA ACA AGA CCC ACA 1034 
Leu Glu Val Lys Pro Glu lie Pro Thr Ser Pro Pro Thr Arg Pro Thr 
125 130 135 140 

ACT ACA AGA CCC ACA ACC ACA AGG CCC ACA ACT ATT TCA ACA AGA TCC 1082 
Thr Thr Arg Pro Thr Thr Thr Arg Pro Thr Thr lie Ser Thr Arg Ser 
145 150 155 

ACA CAT GTA CCA ACA TCA ACC AGA GTC TCC ACC TCT ACT CCA ACA CCA 1130 
Thr His Val Pro Thr Ser Thr Arg Val Ser Thr Ser Thr Pro Thr Pro 
160 165 170 

GAA CAA ACA CAG ACT CAC AAA CCA GAA ATC ACT ACA TTT TAT GCC CAT 1178 
Glu Gin Thr Gin Thr His Lys Pro Glu He Thr Thr Phe Tyr Ala His 
175 180 185 

GAG ACA ACT GCT GAG GTG ACA GAA ACT CCA TCA TAT ACT CCT GCA GAC 1226 
Glu Thr Thr Ala Glu Val Thr Glu Thr Pro Ser Tyr Thr Pro Ala Asp 
190 195 200 

TGG AAT GGC ACT GTG ACA TCC TCA GAG GAG GCC TGG AAT AAT CAC ACT 1274 
Trp Asn Gly Thr Val Thr Ser Ser Glu Glu Ala Trp Asn Asn His Thr 
205 210 215 220 

GTA AGA ATC CCT TTG AGG AAG CCG CAG AGA AAC CCG ACT AAG GGC TTC 1322 
Val Arg He Pro Leu Arg Lys Pro Gin Arg Asn Pro Thr Lys Gly Phe 
225 230 235 

TAT GTT GGC ATG TCC GTT GCA GCC CTG CTG CTG CTG CTG CTT GCG AGC 1370 
Tyr Val Gly Met Ser Val Ala Ala Leu Leu Leu Leu Leu Leu Ala Ser 
240 * 245 250 

ACC GTG GTT GTC ACC AGG TAC ATC ATT ATA AGA AAG AAG ATG GGC TCT 1418 
Thr Val val Val Thr Arg Tyr He He He Arg Lys Lys Met Gly Ser 
255 260 265 

CTG AGC TTT GTT GCC TTC CAT GTC TCT AAG AGT AGA GCT TTG CAG AAC 1466 
Leu Ser Phe Val Ala Phe His Val Ser Lys Ser Arg Ala Leu Gin Asn 
270 275 280 

GCA GCG ATT GTG CAT CCC CGA GCT GAA GAC AAC ATC TAC ATT ATT GAA 1514 
Ala Ala He Val His Pro Arg Ala Glu Asp Asn He Tyr He lie Glu 
285 290 295 300 

GAT AGA TCT CGA GGT GCA GAA TGAGTCCCAG AGGCCTTCTG TGGGGCCTTC 1565 
Asp Arg Ser Arg Gly Ala Glu 
305 
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TGCCTGGGAT TACAGAGATC GTGACTGATT TCACAGAGTA AAATACCCAT TCCAGCTCCT 1625 

GGGAGATTTT GTGTTTTGGT TCTTCCAGCT GCAGTGGAGA GGGTAACCCT CTACCCTGTA 1685 

TATGCAAAAC TCGAGGTTAA CATCATCCTA ATTCTTGTAT CAGCAACACC TCAGTGTCTC 174 5 

CACTCACTGC AGCGATTCTC TCAAATGTGA ACATTTTAGA AGTTTGTGTT TCCTTTTGTC 1805 

CATGTAATCA TTGGTAATAC AAGAATTTTA TCTTGTTTAT TAAAACCATT AATGAGAGGG 1865 

GAATAGGAAT TAAAAGCTGG TGGGAAGGGC CTCCTGAATT TAGAAGCACT TCATGATTGT 1925 

GTTTATCTCT TTTATTGTAA TTTGAAATGT TACTTCTATC CTTCCCAAGG GGCAAAATCA 1985 

TGGGAGCATG GAGGTTTTAA TTGCCCTCAT AGATAAGTAG AAGAAGAGAG TCTAATGCCA 2045 

CCAATAGAGG TGGTTATGCT TTCTCACAGC TCTGGAAATA TGATCATTTA TTATGCAGTT 2105 

GATCTTAGGA TGAGGATGGG TTTCTTAGGA GGAGAGGTTA CCATGGTGAG TGGACCAGGC 216 5 

ACACATCAGG GGAAGAAAAC AATGGATCAA GGGATTGAGT TCATTAGAGC CATTTCCACT 222 5 

CCACTTCTGT CTTGATGCTC AGTGTTCCTA AACTCACCCA CTGAGCTCTG AATTAGGTGC 2285 

AGGGAGGAGA CGTGCAGAAA CGAAAGAGGA AAGAAAGGAG AGAGAGCAGG ACACAGGCTT 2345 

TCTGCTGAGA GAAGTCCTAT TGCAGGTGTG ACAGTGTTTG GGACTACCAC GGGTTTCCTT 2405 

CAGACTTCTA AGTTTCTAAA TCACTATCAT GTGATCATAT TTATTTTTAA AATTATTTCA 2465 

GAAAGACACC ACATTTTCAA TAATAAATCA GTTTGTCACA ATTAATAAAA TATTTTGTTT 2525 

GCTAAGAAGT AAAAAAAAAA AAAAAAAGTC GACGCGGCCG C 2566 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 145.. 1065 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GCGGCCGCGT CGACGGTGCC TGTGAGTAAA TAGATCAGGG TCTCCTTCAC AGCACATTCT 60 
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CCAGGAAGCC GAGCAAACAT TAGTGCTATT TTACCCAGGA GGAAATCTAG GTGTAGAGAG 120 

CTCTACGGAT CTAAGGTCAA CACC ATG GTT CAA CTT CAA GTC TTC ATT TCA 17 X 

Met Val Gin Leu Gin Val Phe lie Ser 
1 5 

GGC CTC CTG CTG CTT CTT CCA GGC TCT GTA GAT TCT TAT GAA GTA GTG 219 
Gly Leu Leu Leu Leu Leu Pro Gly Ser Val Asp Ser Tyr Glu Val Val 
10 15 20 25 

AAG GGG GTG GTG GGT CAC CCT GTC ACA ATT CCA TGT ACT TAC TCA ACA 267 
Lys Gly Val Val Gly His Pro Val Thr lie Pro Cys Thr Tyr Ser Thr 
30 35 40 

CGT GGA GGA ATC ACA ACG ACA TGT TGG GGC CGG GGG CAA TGC CCA TAT 315 
Arg Gly Gly lie Thr Thr Thr Cys Trp Gly Arg Gly Gin Cys Pro Tyr 
45 50 55 

TCT AGT TGT CAA AAT ATA CTT ATT TGG ACC AAT GGA TAC CAA GTC ACC 363 
Ser Ser Cys Gin Asn lie Leu lie Trp Thr Asn Gly Tyr Gin Val Thr 
60 65 70 

TAT CGG AGC AGC GGT CGA TAC AAC ATA AAG GGG CGT ATT TCA GAA GGA 411 
Tyr Arg Ser Ser Gly Arg Tyr Asn lie Lys Gly Arg lie Ser Glu Gly 
75 B0 85 

GAC GTA TCC TTG ACA ATA GAG AAC TCT GTT GAT AGT GAT AGT GGT CTG 459 
Asp Val Ser Leu Thr lie Glu Asn Ser Val Asp Ser Asp Ser Gly Leu 
90 95 100 105 

TAT TGT TGC CGA GTG GAG ATT CCT GGA TGG TTC AAC GAT CAG AAA ATG 507 
Tyr Cys Cys Arg Val Glu lie Pro Gly Trp Phe Asn Asp Gin Lys Met 
110 us 120 

ACC TTT TCA TTG GAA GTT AAA CCA GAA ATT CCC ACA AGT CCT CCA ACA 555 
Thr Phe Ser Leu Glu Val Lys Pro Glu He Pro Thr Ser Pro Pro Thr 
125 130 135 

AGA CCC ACA ACT ACA AGA CCC ACA ACC ACA AGG CCC ACA ACT ATT TCA 603 
Arg Pro Thr Thr Thr Arg Pro Thr Thr Thr Arg Pro Thr Thr He Ser 
140 145 150 

ACA AGA TCC ACA CAT GTA CCA ACA TCA ACC AGA GTC TCC ACC TCT ACT 651 
Thr Arg Ser Thr His Val Pro Thr Ser Thr Arg Val Ser Thr Ser Thr 
155 160 165 

CCA ACA CCA GAA CAA ACA CAG ACT CAC AAA CCA GAA ATC ACT ACA TTT 699 
Pro Thr Pro Glu Gin Thr Gin Thr His Lys Pro Glu He Thr Thr Phe 
"0 175 180 185 

TAT GCC CAT GAG ACA ACT GCT GAG GTG ACA GAA ACT CCA TCA TAT ACT 747 
Tyr Ala His Glu Thr Thr Ala Glu Val Thr Glu Thr Pro Ser Tyr Thr 
190 195 200 
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CCT GCA GAC TGG AAT GGC ACT GTG ACA TCC TCA GAG GAG GCC TGG AAT 795 
Pro Ala Asp Trp Asn Gly Thr Val Thr Ser Ser Glu Glu Ala Tip Asn 
205 210 215 



AAT CAC ACT GTA AGA ATC CCT TTG AGG AAG CCG CAG AGA AAC CCG ACT 843 
Asn His Thr Val Arg lie Pro Leu Arg Lys Pro Gin Arg Asn Pro Thr 
220 225 230 

AAG GGC TTC TAT GTT GGC ATG TCC GTT GCA GCC CTG CTG CTG CTG CTG 891 
Lys Gly Phe Tyr Val Gly Met Ser Val Ala Ala Leu Leu Leu Leu Leu 
235 240 245 



CTT GCG AGC ACC GTG GTT GTC ACC AGG TAC ATC ATT ATA AGA AAG AAG 939 
Leu Ala Ser Thr Val Val Val Thr Arg Tyr lie lie lie Arg Lys Lys 
250 255 260 265 

ATG GGC TCT CTG AGC TTT GTT GCC TTC CAT GTC TCT AAG AGT AGA GCT 987 
Met Gly Ser Leu Ser Phe Val Ala Phe His Val Ser Lys Ser Arg Ala 
270 275 280 

TTG CAG AAC GCA GCG ATT GTG CAT CCC CGA GCT GAA GAC AAC ATC TAC 1035 
Leu Gin Asn Ala Ala lie Val His Pro Arg Ala Glu Asp Asn lie Tyr 
285 290 295 

ATT ATT GAA GAT AGA TCT CGA GGT GCA GAA TGAGTCCCAG AGGCCTTCTG 1085 
lie lie Glu Asp Arg Ser Arg Gly Ala Glu 
300 305 

TGGGGCCTTC TGCCTGGGAT TACAGAGATC GTG AC TG ATT TCACAGAGTA AAATACCCAT 1145 

TCCAGCTCCT GGGAGATTTT GTGTTTTGGT TCTTCCAGCT GCAGTGGAGA GGGTAACCCT 1205 

CTACCCTGTA TATGCAAAAC TCGAGGTTAA CATCATCCTA ATTCTTGTAT CAGCAACACC 1265 

TCAGTGTCTC CACTCACTGC AGCGATTCTC TCAAATGTGA ACATTTTAGA AGTTTGTGTT 1325 

TCCTTTTGTC CATGTAATCA TTGGTAATAC AAGAATTTTA TCTTGTTTAT TAAAACCATT 1385 

AATGAGAGGG GAATAGGAAT TAAAAGCTGG TGGGAAGGGC CTCCTGAATT TAGAAGCACT 1445 

TCATGATTGT GTTTATCTCT TTTATTGTAA TTTGAAATGT TACTTCTATC CTTCCCAAGG 1505 

GGCAAAATCA TGGGAGCATG GAGGTTTTAA TTGCCCTCAT AGATAAGTAG AAGAAGAGAG 1565 

TCTAATGCCA CCAATAGAGG TGGTTATGCT TTCTCACAGC TCTGGAAATA TGATCATTTA 1625 

TTATGCAGTT GATCTTAGGA TGAGGATGGG TTTCTTAGGA GGAGAGGTTA CCATGGTGAG 1685 

TGGACCAGGC ACACATCAGG GGAAGAAAAC AATGGATCAA GGGATTGAGT TCATTAGAGC 1745 

CATTTCCACT CCACTTCTGT CTTGATGCTC AGTGTTCCTA AACTCACCCA CTGAGCTCTG 1805 

AATTAGGTGC AGGGAGGAGA CGTGCAGAAA CGAAAGAGGA AAGAAAGGAG AGAGAGCAGG 1865 

ACACAGGCTT TCTGCTGAGA GAAGTCCTAT TGCAGGTGTG ACAGTGTTTG GGACTACCAC 1925 



WO 97/44460 



PCT/US97/09303 



-37- 

GGGTTTCCTT CAGACTTCTA AGTTTCTAAA TCACTATCAT GTGATCATAT TTATTTTTAA 1985 

AATTATTTCA GAAAGACACC ACATTTTCAA TAATAAATCA GTTTGTCACA ATTAATAAAA 2045 

TATTTTGTTT GCTAAGAAGT AAAAAGTCGA CGCGGCCGC 2084 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Gin Leu Gin Val Phe lie Ser Gly Leu Leu Leu Leu Leu Pro 
15 10 is 

Gly Ser Val Asp Ser Tyr Glu Val Val Lys Gly Val Val Gly His Pro 
20 25 30 

Val Thr He Pro Cys Thr Tyr Ser Thr Arg Gly Gly He Thr Thr Thr 
35 40 45 

Cys Trp Gly Arg Gly Gin Cys Pro Tyr Ser Ser Cys Gin Asn He Leu 
50 55 60 

He Trp Thr Asn Gly Tyr Gin Val Thr Tyr Arg Ser Ser Gly Arg Tyr 
65 70 75 80 

Asn He Lys Gly Arg He Ser Glu Gly Asp Val Ser Leu Thr He Glu 
85 90 95 

Asn Ser Val Asp Ser Asp Ser Gly Leu Tyr Cys Cys Arg Val Glu He 
100 105 HO 

Pro Gly Trp Phe Asn Asp Gin Lys Met Thr Phe Ser Leu Glu Val Lys 
115 120 125 

Pro Glu He Pro Thr Ser Pro Pro Thr Arg Pro Thr Thr Thr Arg Pro 
130 135 140 

Thr Thr Thr Arg Pro Thr Thr He Ser Thr Arg Ser Thr His Val Pro 
!45 150 155 160 

Thr Ser Thr Arg Val Ser Thr Ser Thr Pro Thr Pro Glu Gin Thr Gin 
165 170 175 

Thr His Lys Pro Glu lie Thr Thr Phe Tyr Ala His Glu Thr Thr Ala 
180 185 190 
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Glu Val Thr Glu Thr Pro Ser Tyr Thr Pro Ala Asp Trp Asn Gly Thr 
195 200 205 

Val Thr Ser Ser Glu Glu Ala Trp Asn Asn His Thr Val Arg He Pro 
2X0 215 220 

Leu Arg Lys Pro Gin Arg Asn Pro Thr Lys Gly Phe Tyr Val Gly Met 
225 230 235 240 

Ser Val Ala Ala Leu Leu Leu Leu Leu Leu Ala Ser Thr Val Val Val 
245 250 255 

Thr Arg Tyr He He He Arg Lys Lys Met Gly Ser Leu Ser Phe Val 
260 265 270 

Ala Phe His Val Ser Lys Ser Arg Ala Leu Gin Asn Ala Ala He Val 
275 2B0 285 

His Pro Arg Ala Glu Asp Asn He Tyr He He Glu Asp Arg Ser Arg 
290 295 300 

Gly Ala Glu 
305 

(2) INFORMATION FOR SEQ ID NO:4: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 107.. 1822 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCGGCCGCGT CGACTCGCAG GAGGCCGGCA CTCTGACTCC TGGTGGATGG GACTAGGGAG 60 

TCAGAGTCAA GCCCTGACTG GCTGAGGGCG GGCGCTCCGA GTCAGC ATG GAA AGT 115 

Met Glu Ser 
1 

CTC TGC GGG GTC CTG GTA TTT CTG CTG CTG GCT GCA GGA CTG CCG CTC 163 
Leu Cys Gly Val Leu Val Phe Leu Leu Leu Ala Ala Gly Leu Pro Leu 
5 10 15 



CAG GCG GCC AAG CGG TTC CGT GAT GTG CTG GGC CAT GAG CAG TAT CCG 211 
Gin Ala Ala Lys Arg Phe Arg Asp Val Leu Gly His Glu Gin Tyr Pro 
20 25 30 35 
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GAT CAC ATG AGG GAG AAC AAC CAA TTA CGT GGC TGG TCT TCA GAT GAA 259 
Asp His Met Arg Glu Asn Asn Gin Leu Arg Gly Trp Ser Ser Asp Glu 
40 45 50 

AAT GAA TGG GAT GAA CAG CTG TAT CCA GTG TGG AGG AGG GGA GAG GGC 307 
Asn Glu Trp Asp Glu Gin Leu Tyr Pro Val Trp Arg Arg Gly Glu Gly 
55 60 65 

AGA TGG AAG GAC TCC TGG GAA GGA GGC CGT GTG CAG GCA GCC CTA ACC 355 
Arg Trp Lys Asp Ser Trp Glu Gly Gly Arg Val Gin Ala Ala Leu Thr 
70 75 80 

AGT GAT TCA CCG GCC TTG GTG GGT TCC AAT ATC ACC TTC GTA GTG AAC 403 
Ser Asp Ser Pro Ala Leu Val Gly Ser Asn He Thr Phe Val Val Asn 
85 90 95 

CTG GTG TTC CCC AGA TGC CAG AAG GAA GAT GCC AAC GGC AAT ATC GTC 451 
Leu Val Phe Pro Arg Cys Gin Lys Glu Asp Ala Asn Gly Asn He Val 
100 105 no us 

TAT GAG AGG AAC TGC AGA AGT GAT TTG GAG CTG GCT TCT GAC CCG TAT 499 
Tyr Glu Arg Asn Cys Arg Ser Asp Leu Glu Leu Ala Ser Asp Pro Tyr 
120 125 130 

GTC TAC AAC TGG ACC ACA GGG GCA GAC GAT GAG GAC TGG GAA GAC AGC 547 
Val Tyr Asn Trp Thr Thr Gly Ala Asp Asp Glu Asp Trp Glu Asp Ser 
135 140 145 

ACC AGC CAA GGC CAG CAC CTC AGG TTC CCC GAC GGG AAG CCC TTC CCT 595 
Thr Ser Gin Gly Gin His Leu Arg Phe Pro Asp Gly Lys Pro Phe Pro 
150 155 160 

CGC CCC CAC GGA CGG AAG AAA TGG AAC TTC GTC TAC GTC TTC CAC ACA 643 
Arg Pro His Gly Arg Lys Lys Trp Asn Phe Val Tyr Val Phe His Thr 
165 170 175 

CTT GGT CAG TAT TTT CAA AAG CTG GGT CGG TGT TCA GCA CGA GTT TCT 691 
Leu Gly Gin Tyr Phe Gin Lys Leu Gly Arg Cys Ser Ala Arg Val Ser 
"0 185 190 195 

ATA AAC ACA GTC AAC TTG ACA GTT GGC CCT CAG GTC ATG GAA GTG ATT 739 
He Asn Thr Val Asn Leu Thr Val Gly Pro Gin Val Met Glu Val He 
200 205 210 

GTC TTT CGA AGA CAC GGC CGG GCA TAC ATT CCC ATC TCC AAA GTG AAA 787 
Val Phe Arg Arg His Gly Arg Ala Tyr He Pro He Ser Lys Val Lys 
215 220 225 

GAC GTG TAT GTG ATA ACA GAT CAG ATC CCT ATA TTC GTG ACC ATG TAC 835 
Asp Val Tyr Val He Thr Asp Gin He Pro He Phe Val Thr Met Tyr 
230 235 240 

CAG AAG AAT GAC CGG AAC TCG TCT GAT GAA ACC TTC CTC AGA GAC CTC 883 
Gin Lys Asn Asp Arg Asn Ser Ser Asp Glu Thr Phe Leu Arg Asp Leu 
245 250 255 
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CCC ATT TTC TTC GAT GTC CTC ATT CAC GAT CCC AGT CAT TTC CTC AAC 931 

Pro lie Phe Phe Asp Val Leu lie His Asp Pro Ser His Phe Leu Asn 
260 265 270 275 

TAG TCT GCC ATT TCC TAC AAG TGG AAC TTT GGG GAC AAC ACT GGC CTG 979 
Tyr Ser Ala He Ser Tyr Lys Trp Asn Phe Gly Asp Asn Thr Gly Leu 
2B0 285 290 

TTT GTC TCC AAC AAT CAC ACT TTG AAT CAC ACG TAT GTG CTC AAT GGA 1027 
Phe Val Ser Asn Asn His Thr Leu Asn His Thr Tyr Val Leu Asn Gly 
295 300 305 

ACC TTC AAC TTT AAC CTC ACC GTG CAA ACT GCA GTG CCG GGA CCA TGC 1075 
Thr Phe Asn Phe Asn Leu Thr Val Gin Thr Ala Val Pro Gly Pro Cys 
310 315 320 

CCC TCA CCC ACA CCT TCG CCT TCT TCT TCG ACT TCT CCT TCG CCT GCA 1123 
Pro Ser Pro Thr Pro Ser Pro Ser Ser Ser Thr Ser Pro Ser Pro Ala 
325 330 335 

TCT TCG CCT TCA CCC ACA TTA TCA ACA CCT AGT CCC TCT TTA ATG CCT 1171 
Ser Ser Pro Ser Pro Thr Leu Ser Thr Pro Ser Pro Ser Leu Met Pro 
340 345 350 355 

ACT GGC CAC AAA TCC ATG GAG CTG AGT GAC ATT TCC AAT GAA AAC TGC 1219 
Thr Gly His Lys Ser Met Glu Leu Ser Asp He Ser Asn Glu Asn Cys 
360 365 370 

CGA ATA AAC AGA TAT GGT TAC TTC AGA GCC ACC ATC ACA ATT GTA GAT 1267 
Arg He Asn Arg Tyr Gly Tyr Phe Arg Ala Thr He Thr lie Val Asp 
375 3B0 365 

GGA ATC CTA GAA GTC AAC ATC ATC CAG GTA GCA GAT GTC CCA ATC CCC 1315 
Gly He Leu Glu Val Asn He He Gin Val Ala Asp Val Pro He Pro 
390 395 400 

ACA CCG CAG CCT GAC AAC TCA CTG ATG GAC TTC ATT GTG ACC TGC AAA 1363 
Thr Pro Gin Pro Asp Asn Ser Leu Met Asp Phe He Val Thr Cys Lys 
405 410 415 

GGG GCC ACT CCC ACG GAA GCC TGT ACG ATC ATC TCT GAC CCC ACC TGC 1411 
Gly Ala Thr Pro Thr Glu Ala Cys Thr He He Ser Asp Pro Thr Cys 
420 425 430 435 

CAG ATC GCC CAG AAC AGG GTG TGC AGC CCG GTG GCT GTG GAT GAG CTG 1459 
Gin He Ala Gin Asn Arg Val Cys Ser Pro Val Ala Val Asp Glu Leu 
440 445 450 

TGC CTC CTG TCC GTG AGG AGA GCC TTC AAT GGG TCC GGC ACG TAC TGT 1507 
Cys Leu Leu Ser Val Arg Arg Ala Phe Asn Gly Ser Gly Thr Tyr Cys 
455 460 465 

GTG AAT TTC ACT CTG GGA GAC GAT GCA AGC CTG GCC CTC ACC AGC GCC 1555 
Val Asn Phe Thr Leu Gly Asp Asp Ala Ser Leu Ala Leu Thr Ser Ala 
470 475 480 
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CTG ATC TCT ATC CCT GGC AAA GAC CTA GGC TCC CCT CTG AGA ACA GTG 1603 
Leu lie Ser lie Pro Gly Lys Asp Leu Gly Ser Pro Leu Arg Thr Val 
485 490 495 

AAT GGT GTC CTG ATC TCC ATT GGC TGC CTG GCC ATG TTT GTC ACC ATG 1651 
Asn Gly Val Leu He Ser He Gly Cys Leu Ala Met Phe Val Thr Met 
500 505 510 515 

GTT ACC ATC TTG CTG TAC AAA AAA CAC AAG ACG TAC AAG CCA ATA GGA 1699 
Val Thr He Leu Leu Tyr Lys Lys His Lys Thr Tyr Lys Pro He Gly 
520 525 530 

AAC TGC ACC AGG AAC GTG GTC AAG GGC AAA GGC CTG AGT GTT TTT CTC 1747 
Asn Cys Thr Arg Asn Val Val Lys Gly Lys Gly Leu Ser Val Phe Leu 
535 540 545 

AGC CAT GCA AAA GCC CCG TTC TCC CGA GGA GAC CGG GAG AAG GAT CCA 1795 
Ser His Ala Lys Ala Pro Phe Ser Arg Gly Asp Arg Glu Lys Asp Pro 
550 555 560 

CTG CTC CAG GAC AAG CCA TGG ATG CTC TAAGTCTTCA CTCTCACTTC 1842 
Leu Leu Gin Asp Lys Pro Trp Met Leu 
565 570 

TGACTGGGAA CCCACTCTTC TGTGCATGTA TGTGAGCTGT GCAGAAGTAC ATGACTGGTA 1902 

GCTGTTGTTT TCTACGGATT ATTGTAAAAT GTATATCATG GTTTAGGGAG CGTAGTTAAT 1962 

TGGCATTTTA GTGAAGGGAT GGGAAGACAG TATTTCTTCA CATCTGTATT GTGGTTTTTA 2022 

TACTGTTAAT AGGGTGGGCA CATTGTGTCT GAAGGGGGAG GGGGAGGTCA CTGCTACTTA 2082 

AGGTCCTAGG TTAACTGGGA GAGGATGCCC CAGGCTCCTT AGATTTCTAC ACAAGATGTG 2142 

CCTGAACCCA GCTAGTCCTG ACCTAAAGGC CATGCTTCAT CAACTCTATC TCAGCTCATT 2202 

GAACATACCT GAGCACCTGA TGGAATTATA ATGGAACCAA GCTTGTTGTA TGGTGTGTGT 2262 

GTGTACATAA GATACTCATT AAAAAGACAG TCTATTAAAA A 2303 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Glu Ser Leu Cys Gly Val Leu Val Phe Leu Leu Leu Ala Ala Gly 
1 5 10 15 
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Leu Pro Leu Gin Ala Ala Lys Arg Phe Arg Asp Val Leu Gly His Glu 
20 25 30 

Gin Tyr Pro Asp His Met Arg Glu Asn Asn Gin Leu Arg Gly Trp Ser 
35 40 45 

Ser Asp Glu Asn Glu Trp Asp Glu Gin Leu Tyr Pro Val Trp Arg Arg 
50 55 60 

Gly Glu Gly Arg Trp Lys Asp Ser Trp Glu Gly Gly Arg Val Gin Ala 
65 70 75 BO 

Ala Leu Thr Ser Asp Ser Pro Ala Leu Val Gly Ser Asn lie Thr Phe 
B5 90 95 

Val Val Asn Leu Val Phe Pro Arg Cys Gin Lys Glu Asp Ala Asn Gly 
100 105 110 

Asn lie Val Tyr Glu Arg Asn Cys Arg Ser Asp Leu Glu Leu Ala Ser 
115 120 125 

Asp Pro Tyr Val Tyr Asn Trp Thr Thr Gly Ala Asp Asp Glu Asp Trp 
130 135 140 

Glu Asp Ser Thr Ser Gin Gly Gin His Leu Arg Phe Pro Asp Gly Lys 
145 150 155 160 

Pro Phe Pro Arg Pro His Gly Arg Lys Lys Trp Asn Phe Val Tyr Val 
165 170 175 

Phe His Thr Leu Gly Gin Tyr Phe Gin Lys Leu Gly Arg Cys Ser Ala 
180 185 190 

Arg Val Ser He Asn Thr Val Asn Leu Thr Val Gly Pro Gin Val Met 
195 200 205 

Glu Val He Val Phe Arg Arg His Gly Arg Ala Tyr He Pro He Ser 
210 215 220 

Lys Val Lys Asp Val Tyr Val He Thr Asp Gin He Pro He Phe Val 
225 230 235 240 

Thr Met Tyr Gin Lys Asn Asp Arg Asn Ser Ser Asp Glu Thr Phe Leu 
245 250 255 

Arg Asp Leu Pro He Phe Phe Asp Val Leu He His Asp Pro Ser His 
260 265 270 

Phe Leu Asn Tyr Ser Ala He Ser Tyr Lys Trp Asn Phe Gly Asp Asn 
275 280 2B5 

Thr Gly Leu Phe Val Ser Asn Asn His Thr Leu Asn His Thr Tyr Val 
290 295 300 
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Leu Asn Gly Thr Phe Asn Phe Asn Leu Thr Val Gin Thr Ala Val Pro 
305 310 315 320 

Gly Pro Cys Pro Ser Pro Thr Pro Ser Pro Ser Ser Ser Thr Ser Pro 
325 330 335 

Ser Pro Ala Ser Ser Pro Ser Pro Thr Leu Ser Thr Pro Ser Pro Ser 
340 345 350 

Leu Met Pro Thr Gly His Lys Ser Met Glu Leu Ser Asp He Ser Asn 
355 360 365 

Glu Asn Cys Arg He Asn Arg Tyr Gly Tyr Phe Arg Ala Thr He Thr 
370 375 360 

He Val Asp Gly He Leu Glu Val Asn lie He Gin Val Ala Asp Val 
385 390 395 400 

Pro He Pro Thr Pro Gin Pro Asp Asn Ser Leu Met Asp Phe He Val 
405 410 415 

Thr Cys Lys Gly Ala Thr Pro Thr Glu Ala Cys Thr He He Ser Asp 
420 425 430 

Pro Thr Cys Gin He Ala Gin Asn Arg Val Cys Ser Pro Val Ala Val 
435 440 445 

Asp Glu Leu Cys Leu Leu Ser Val Arg Arg Ala Phe Asn Gly Ser Gly 
450 455 460 

Thr Tyr Cys Val Asn Phe Thr Leu Gly Asp Asp Ala Ser Leu Ala Leu 
4 * 5 470 475 480 

Thr Ser Ala Leu He Ser. He Pro Gly Lys Asp Leu Gly Ser Pro Leu 
485 490 495 

Arg Thr Val Asn Gly Val Leu He Ser He Gly Cys Leu Ala Met Phe 
500 505 5io 

Val Thr Met Val Thr He Leu Leu Tyr Lys Lys His Lys Thr Tyr Lys 
515 520 525 

Pro He Gly Asn Cys Thr Arg Asn Val Val Lys Gly Lys Gly Leu Ser 
530 535 540 

Val Phe Leu Ser His Ala Lys Ala Pro Phe Ser Arg Gly Asp Arg Glu 
545 550 555 560 

Lys Asp Pro Leu Leu Gin Asp Lys Pro Trp Met Leu 
565 570 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1795 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 278.. 1279 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCGGCCGCGT CGACGAAGCT GGGAAGTCAG GGGCTGTTTC TGTGGGCAGC TTTCCCTGTC 60 

CTTTGGAAGG CACAGAGCTC TCAGCTGCAG GGAACTAACA GAGCTCTGAA GCCGTTATAT 120 

GTGGTCTTCT CTCATTTCCA GCAGAGCAGG CTCATATGAA TCAACCAACT GGGTGAAAAG 180 

ATAAGTTGCA ATCTGAGATT TAAGACTTGA TCAGATACCA TCTGGTGGAG GGTACCAACC 240 

AGCCTGTCTG CTCATTTTCC TTCAGGCTGA TCCCATA ATG CAT CCT CAA GTG GTC 295 

Met His Pro Gin Val Val 
1 5 

ATC TTA AGC CTC ATC CTA CAT CTG GCA GAT TCT GTA GCT GGT TCT GTA 343 
lie Leu Ser Leu lie Leu His Leu Ala Asp Ser Val Ala Gly Ser Val 
10 15 20 

AAG GTT GGT GGA GAG GCA GGT CCA TCT GTC ACA CTA CCC TGC CAC TAC 391 
Lys Val Gly Gly Glu Ala Gly Pro Ser Val Thr Leu Pro Cys His Tyr 
25 30 35 

AGT GGA GCT GTC ACA TCA ATG TGC TGG AAT AGA GGC TCA TGT TCT CTA 439 
Ser Gly Ala Val Thr Ser Met Cys Trp Asn Arg Gly Ser Cys Ser Leu 
40 45 50 

TTC ACA TGC CAA AAT GGC ATT GTC TGG ACC AAT GGA ACC CAC GTC ACC 487 
Phe Thr Cys Gin Asn Gly lie Val Trp Thr Asn Gly Thr His Val Thr 
55 60 65 70 

TAT CGG AAG GAC ACA CGC TAT AAG CTA TTG GGG GAC CTT TCA AGA AGG 535 
Tyr Arg Lys Asp Thr Arg Tyr Lys Leu Leu Gly Asp Leu Ser Arg Arg 
75 80 85 

GAT GTC TCT TTG ACC ATA GAA AAT ACA GCT GTG TCT GAC AGT GGC GTA 583 
Asp Val Ser Leu Thr lie Glu Asn Thr Ala Val Ser Asp Ser Gly Val 
90 95 100 

TAT TGT TGC CGT GTT GAG CAC CGT GGG TGG TTC AAT GAC ATG AAA ATC 631 
Tyr Cys Cys Arg Val Glu His Arg Gly Trp Phe Asn Asp Met Lys lie 
105 110 115 
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ACC GTA TCA TTG GAG ATT GTG CCA CCC AAG GTC ACG ACT ACT CCA ATT 679 
Thr Val Ser Leu Glu lie Val Pro Pro Lya Val Thr Thr Thr Pro lie 
120 125 130 

GTC ACA ACT GTT CCA ACC GTC ACG ACT GTT CGA ACG AGC ACC ACT GTT 727 
Val Thr Thr Val Pro Thr Val Thr Thr Val Arg Thr Ser Thr Thr Val 
135 140 145 150 

CCA ACG ACA ACG ACT GTT CCA ACG ACA ACT GTT CCA ACA ACA ATG AGC 775 
Pro Thr Thr Thr Thr Val Pro Thr Thr Thr Val Pro Thr Thr Met Ser 
155 160 165 

ATT CCA ACG ACA ACG ACT GTT CCG ACG ACA ATG ACT GTT TCA ACG ACA 823 
lie Pro Thr Thr Thr Thr Val Pro Thr Thr Met Thr Val Ser Thr Thr 
170 175 1B0 

ACG AGC GTT CCA ACG ACA ACG AGC ATT CCA ACA ACA ACA AGT GTT CCA 871 
Thr Ser Val Pro Thr Thr Thr Ser lie Pro Thr Thr Thr Ser Val Pro 
185 190 195 

GTG ACA ACA ACG GTC TCT ACC TTT GTT CCT CCA ATG CCT TTG CCC AGG 919 
Val Thr Thr Thr Val Ser Thr Phe Val Pro Pro Met Pro Leu Pro Arg 
200 205 210 

CAG AAC CAT GAA CCA GTA GCC ACT TCA CCA TCT TCA CCT CAG CCA GCA 967 
Gin Asn His Glu Pro Val Ala Thr Ser Pro Ser Ser Pro Gin Pro Ala 
215 220 225 230 

GAA ACC CAC CCT ACG ACA CTG CAG GGA GCA ATA AGG AGA GAA CCC ACC 1015 
Glu Thr His Pro Thr Thr Leu Gin Gly Ala He Arg Arg Glu Pro Thr 
235 240 245 

AGC TCA CCA TTG TAC TCT TAC ACA ACA GAT GGG AAT GAC ACC GTG ACA 1063 
Ser Ser Pro Leu Tyr Ser Tyr Thr Thr Asp Gly Asn Asp Thr Val Thr 
250 255 260 

GAG TCT TCA GAT GGC CTT TGG AAT AAC AAT CAA ACT CAA CTG TTC CTA 1111 
Glu Ser Ser Asp Gly Leu Trp Asn Asn Asn Gin Thr Gin Leu Phe Leu 
265 270 275 

GAA CAT AGT CTA CTG ACG GCC AAT ACC ACT AAA GGA ATC TAT GCT GGA 1159 
Glu His Ser Leu Leu Thr Ala Asn Thr Thr Lys Gly He Tyr Ala Gly 
280 285 290 

GTC TGT ATT TCT GTC TTG GTG CTT CTT GCT CTT TTG GGT GTC ATC ATT 1207 
Val Cys He Ser Val Leu Val Leu Leu Ala Leu Leu Gly Val He He 
29 5 300 305 310 

GCC AAA AAG TAT TTC TTC AAA AAG GAG GTT CAA CAA CTA AGA CCC CAT 1255 
Ala Lys Lys Tyr Phe Phe Lys Lys Glu Val Gin Gin Leu Arg Pro His 
315 320 325 



AAA TCC TGT ATA CAT CAA AGA GAA TAGTCCCTGG AAACATAGCA AATGAACTTC 
Lys Ser Cys He His Gin Arg Glu 
330 



1309 
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TATCTTGGCC ATCACAGCTG TCCAGAAGAG GGGAATCTGT CTTAAAAACC AGCAAATCCA 1369 

ACGTGAGACT TCATTTGGAA GCATTGTATG ATTATCTCTT GTTTCTATGT TATACTTCCA 1429 

AATGTTGCAT TTCCTATGTT TTCCAAAGGT TTCAAATCGT GGGTTTTTAT TTCCTCCGTG 1489 

GGGAAACAAA GTGAGTCTAA CTCACAGGTT TAGCTGTTTT CTCATAACTC TGGAAATGTG 1549 

ATGCATTAAG TACTGGATCT CTGAATTGGG GTAGCTGTTT TACCAGTTAA AGAGCCTACA 1609 

ATAGTATGGA ACACATAGAC ACCAGGGGAA GAAAATCATT TGCGAGGTGA TTTAACATAT 1669 

TTATGCAATT TTTTTTTTTT TTTTTGAGAT GGAGCTTTGC TCTTGTTGCC CAGGCTGGAG 1729 

TGCGATGGTG AAATCTCGGC TCACTGTAAC CTCCACCTTC CGGGTTCAAG CAATTCTCCC 1789 

GTCGAC 1795 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met His Pro Gin Val Val He Leu Ser Leu He Leu His Leu Ala Asp 
15 10 15 

Ser Val Ala Gly Ser Val Lys Val Gly Gly Glu Ala Gly Pro Ser Val 
20 25 30 

Thr Leu Pro Cya His Tyr Ser Gly Ala Val Thr Ser Met Cys Trp Asn 
35 40 45 

Arg Gly Ser Cys Ser Leu Phe Thr Cys Gin Asn Gly He Val Trp Thr 
50 55 60 

Asn Gly Thr His Val Thr Tyr Arg Lys Asp Thr Arg Tyr Lys Leu Leu 
65 70 75 80 

Gly Asp Leu Ser Arg Arg Asp Val Ser Leu Thr He Glu Asn Thr Ala 
85 * 90 95 

Val Ser Asp Ser Gly Val Tyr Cys Cys Arg Val Glu His Arg Gly Trp 
100 105 110 

Phe Asn Asp Met Lya He Thr Val Ser Leu Glu He Val Pro Pro Lys 
115 120 125 
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Val Thr Thr Thr Pro lie Val Thr Thr Val Pro Thr Val Thr Thr Val 
130 135 140 

Arg Thr Ser Thr Thr Val Pro Thr Thr Thr Thr Val Pro Thr Thr Thr 
l « 150 155 160 

Val Pro Thr Thr Met Ser He Pro Thr Thr Thr Thr Val Pro Thr Thr 
165 170 175 

Met Thr Val Ser Thr Thr Thr Ser Val Pro Thr Thr Thr Ser He Pro 
180 185 190 

Thr Thr Thr Ser Val Pro Val Thr Thr Thr Val Ser Thr Phe Val Pro 
195 200 205 

Pro Met Pro Leu Pro Arg Gin Asn His Glu Pro Val Ala Thr Ser Pro 
210 215 220 

Ser Ser Pro Gin Pro Ala Glu Thr His Pro Thr Thr Leu Gin Gly Ala 
225 230 235 240 

He Arg Arg Glu Pro Thr Ser Ser Pro Leu Tyr Ser Tyr Thr Thr Asp 
2 45 250 255 

Gly Asn Asp Thr Val Thr Glu Ser Ser Asp Gly Leu Trp Asn Asn Asn 
260 265 270 

Gin Thr Gin Leu Phe Leu Glu His Ser Leu Leu Thr Ala Asn Thr Thr 
2 75 280 285 

Lys Gly He Tyr Ala Gly Val Cys He Ser Val Leu Val Leu Leu Ala 
290 295 300 

Leu Leu Gly Val He He Ala Lys Lys Tyr Phe Phe Lys Lys Glu Val 
305 310 315 320 



Gin Gin Leu Arg Pro His Lys Ser Cys He His Gin Arg Glu 
325 330 
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What is claimed is: 



1 1 . A purified and isolated DNA molecule having a nucleotide sequence set forth in SEQ ID 

2 NO: 1 , SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. 

1 2. A purified and isolated DNA molecule selected from among: 

2 a) the DNA molecule of SEQ ID NO: 1 or its complementary strand; 

3 b) the DNA molecule of SEQ ID NO:2 or its complementary strand; 

4 c) the DNA molecule of SEQ ID NO:4 or its complementary strand; 

5 d) the DNA molecule of SEQ ID NO:6 or its complementary strand; 

6 e) DNA molecules which hybridize under stringent conditions to the DNA molecule 

7 defined in a), b), c) or d), or fragments thereof; 

8 f) DNA molecules which, but for the degeneracy of the genetic code, would hybridize to 

9 the DNA molecule defined in a), b), c), d) or e). 

1 3. The recombinant DNA molecule according to claim 1 or 2, operably linked to an 

2 expression control sequence. 

1 4. A vector comprising a purified and isolated DNA molecule having a nucleotide sequence 

2 set forth in SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6. 

1 5. A biologically functional plasmid or viral DNA vector comprising a DNA molecule 

2 according to one of claims 1, 2 or 3. 

1 6. A prokaryotic or eukaryotic host cell stably transformed or transfected by a vector 

2 comprising a DNA molecule of claim 1 . 



1 

2 



7. A process for the production of a polypeptide product encoded by a DNA molecule 
according to claim 1 , 2 or 3, said process comprising: 
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3 growing, under suitable culture conditions, prokaryotic or eukaiyotic host cells transformed 

4 or transfected with the DNA molecule in a manner allowing expression of the DNA 

5 molecule, and recovering the polypeptide product of said expression. 

1 8. A polypeptide product produced by the process of claim 7. 

1 9. A protein having an amino acid sequence which comprises SEQ ID NO:3, SEQ ID NO:5 

2 orSEQIDNO:7. 

1 1 0. A purified and isolated protein encoded by the DNA of SEQ ID NO: 1 , SEQ ID NO:2, 

2 SEQ ID NO:4 or SEQ ID NO:6. 

1 11. The protein of claim 9 or 1 0, substantially free of other human proteins. 

1 12. A protein which is a variant of SEQ ID NO:3, SEQ ID NO:5 or SEQ ID NO:7. 

1 13. A soluble variant of the protein according to claim 9, 10, 1 1 or 12. 

1 14. An IgG fusion protein comprising the protein of claim 9, 10, 1 1, 12 or 13. 

1 15. The soluble protein of claim 1 3 , fused to a toxin, imageable compound or radionuclide. 

1 16. A specific monoclonal antibody to a protein of claim 9, 10, 1 1 or 12. 

1 1 7. The antibody of claim 1 6, associated with a toxin, imageable compound or radionuclide. 

1 1 8. A hybridoma cell line which produces a specific antibody to the protein of claim 9, 1 0, 

2 11, 12 or 13. 

1 1 9. An antibody produced by a hybridoma of claim 1 8. 
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1 20. A pharmaceutical composition comprising a therapeutically effective amount of the 

2 protein of claim 9, 10, 1 1, 12, 13, 14 or 15, and further comprising a pharmacologically 

3 acceptable carrier. 

1 21. A pharmaceutical composition comprising a therapeutically effective amount of the 

2 antibody of claim 16, 17 or 19, and further comprising a pharmacologically acceptable carrier. 

1 22. A method of treating a subject with renal disease, comprising administering to the 

2 subject a therapeutically effective amount of the protein of claim 9, 10, 1 1, 12, 13, 14 or 15. 

1 23. A method of treating a subject with renal disease, comprising administering to the 

2 subject a therapeutically effective amount of the antibody of claim 16, 17 or 19. 

1 24. A method of treating a subject with renal disease, comprising administering to the 

2 subject a therapeutically effective amount of the pharmaceutical composition of claim 20. 

1 25. A method of promoting growth of new tissue in a subject, comprising administering to 

2 the subject a therapeutically effective amount of the protein of claim 9, 10, 1 1, 12, 13 or 14. 

1 26. The method of claim 25, wherein the tissue is renal tissue. 

1 27. A method of promoting survival of damaged tissue in a subject, comprising 

2 administering to the subject a therapeutically effective amount of the protein of claim 9, 1 0, 1 1 , 

3 12, 13 or 14. 

1 28. The method of claim 27, wherein the tissue is renal tissue. 

1 29. A method of treating a subject with renal disease, comprising administering to the 

2 subject a therapeutically effective amount of the antibody of claim 16, 17 or 19. 
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1 30. A method of treating a subject with renal disease, comprising administering to the 

2 subject a therapeutically effective amount of the pharmaceutical composition of claim 21 . 

1 3 1 . A method of promoting growth of new tissue in a subject, comprising administering to 

2 the subject a therapeutically effective amount of the antibody of claim 16, 1 7 or 1 9. 

1 32. A method of promoting survival of damaged tissue in a subject, comprising 

2 administering to the subject a therapeutically effective amount of the antibody of claim 1 6, 1 7 or 

3 19. 

1 33. A method of treating a subject with a renal disorder, comprising administering to the 

2 subject a vector of claim 4 or 5. 

1 34. A method of promoting growth of new tissue in a subject, comprising administering to 

2 the subject a vector of claim 4 or 5. 

1 35. A method of promoting survival of damaged tissue in a subject, comprising 

2 administering a therapeutically effective amount of a vector of claim 4 or 5 to the subject 

1 36. The method of claim 34 or 35, wherein the tissue is renal tissue. 



1 37. A method for targeting an imageable compound to a cell expressing a protein of SEQ 

2 ID NO:3, SEQ ID NO:5 or SEQ ID NO:7, comprising contacting the cell with a monoclonal 

3 antibody of claim 1 6 fused to an imageable compound. 

1 38. The method of claim 37, wherein the cell is within a subject, and the monoclonal 

2 antibody is administered to the subject. 

1 39. A method of identifying damage or regeneration of renal cells in a subject, comprising 

2 comparing level of expression of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO:6 
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3 in renal cells of the subject to a control level of expression of SEQ ID NO: 1, SEQ ID NO:2, SEQ 

4 ID NO:4 or SEQ ID NO:6 in control renal cells. 

1 40. A method of identifying upregulation of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:4 

2 or SEQ ID NO:6 in cells comprising contacting the cells with an anti sense probe and measuring 

3 hybridization to RNA within the cell. 

1 41 . A method of identifying damage or regeneration of renal cells in a subject, comprising 

2 comparing concentration of SEQ ID NO:3, SEQ ID NO:5 or SEQ ID NO:7 in renal cells, renal 

3 cell fragments or body fluids of the subject to a control level of expression of SEQ ID NO:3, 

4 SEQ ID NO:5 or SEQ ID NO:7 in control renal cells. 

1 42. The method of claim 41, wherein the fluid is urine or serum. 

1 43. The method of claim 41, wherein the renal cells or renal cell fragments are obtained 

2 from urine sediment of the subject. 
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1 GCGGCCGCGTCGACGGTGCCTGTGAGTAAATAGATCAGGGTCTCCTTCAC 5 0 
51 AGCACATTCTCCAGGAAGCCGAGCAAACATTAGTGCTATTTTACCCAGGA 100 
101 GGAAATCTAGGTGTAGAGAGCTCTACGGATCTAAGGTTTGGATCTGTACC 150 

• • ■ 4 , 

151 CAGTGCTTTTTTAGGTGTCTTTAGACATTTCTCAGGAAGATGTAGTCTCT 200 

201 GTCACCATGTGTGGCTGAATTCTAGCTCAGTCCATCTTATTGTGTTTAAG 250 

251 GTAGTTGAAGTTTAGGAACCAACCAGTATGTCTCTGAGCAGAAGAGTACA 300 

3 01 GTGTCCATCTTGAGGACAAGCTCATCTTTACCATTAGAGGGCTGGCCTTG 350 

351 GCTTAGATTCTACCGAGAACATACTCTCTAATGGCTGCCCTCAGTTTTCT 400 

• * • • 

401 CTGTTTGCTGTCTTATTTGTGTCATGGCCAGAAGTCATATGGATGGCTCT 4 50 

4 51 ATGTGAGCAAGGACCCAGATAGAAGAGTGTATTTGGGGGAACAGGTTGCC 500 

501 CTAACAGAGAGTCCTGTGGGATTCATGCAGTCAGGATGAAGACCTGATCA 5 50 

• • ■ ■ . 

551 GACAGAGTGTGCTGAGTGCCACGGCTAACCAGAGTGACTTGTCACTGTCC 600 

• • • 

601 TTCAGGTCAACACCATGGTTCAACTTCAAGTCTTCATTTCAGGCCTCCTG 650 

MVQLQVFISGLL 

* 

651 CTGCTTCTTCCAGGCTCTGTAGATTCTTATGAAGTAGTGAAGGGGGTGGT 700 
LLLPGSVDSYEVVKGVV 

701 GGGTCACCCTGTCACAATTCCATGTACTTACTCAACACGTGGAGGAATCA 750 
GHPVTI PCTYSTRGGI T 

751 CAACGACATGTTGGGGCCGGGGGCAATGCCCATATTCTAGTTGTCAAAAT 800 
TTCWGRGQCPYSSCQN 

• * * • 

801 ATACTTATTTGGACCAATGGATACCAAGTCACCTATCGGAGCAGCGGTCG 850 
I LIWTNGYQVTYRSSGR 

851 ATACAACATAAAGGGGCGTATTTCAGAAGGAGACGTATCCTTGACAATAG 900 
YNIKGRISEGDVSLTIE 

901 AGAACTCTGTTGATAGTGATAGTGGTCTGTATTGTTGCCGAGTGGAGATT 950 
NSVDSDSGLYCCRVE I 

951 CCTGGATGGTTCAACGATCAGAAAATGACCTTTTCATTGGAAGTTAAACC 1000 
PGWFNDQKMTFSLEVKP 

1001 AGAAATTCCCACAAGTCCTCCAACAAGACCCACAACTACAAGACCCACAA 1050 
EIPTSPPTRPTTTRPTT 

• * » • 

1051 CCACAAGGCCCACAACTATTTCAACAAGATCCACACATGTACCAACATCA 1100 
TRPTT I STRSTHVPTS 
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1101 ACCAGAGTCTCCACCTCTACTCC7VACACCAGAACAAACACAGACTCACAA 1150 
TRVSTSTPTPEQTQTHK 

• • . • . 

1151 ACCAGAAATCACTACATTTTATGCCCATGAGACAACTGCTGAGGTGACAG 1200 
PE ITTFYAHETTAEVTE 

• • • • 

1201 AAACTCCATCATATACTCCTGCAGACTGGAATGGCACTGTGACATCCTCA 1250 
TPSYTPADWNGTVTSS 

1251 GAGGAGGCCTGGAATAATCACACTGTAAGAATCCCTTTGAGGAAGCCGCA 1300 
EEAWNNHTVRI PLRKPQ 

13 01 GAGAAACCCGACTAAGGGCTTCTATGTTGGCATGTCCGTTGCAGCCCTGC 1350 
RNPTKGFYVGMSVAALL 

1351 TGCTGCTGCTGCTTGCGAGCACCGTGGTTGTCACCAGGTACATCATTATA 1400 
LLLLASTVVVTRYI I I 

• • • • . 

1401 AGAAAGAAGATGGGCTCTCTGAGCTTTGTTGCCTTCC ATGTCTCTAAGAG 1450 
RKKMGSLSFVAFHVSKS 

1451 TAGAGCTTTGCAGAACGCAGCGATTGTGCATCCCCGAGCTGAAGACAACA 1500 
RALQNAAIVHPRAEDNI 

• • • • • 

1501 TCTACATTATTGAAGATAGATCTCGAGGTGCAGAATGAGTCCCAGAGGCC 1 550 
Y I IEDRSRGAE 

• • * * . 

1551 TTCTGTGGGGCCTTCTGCCTGGGATTACAGAGATCGTGACTGATTTCACA 1600 

1601 GAGTAAAATACCCATTCCAGCTCCTGGGAGATTTTGTGTTTTGGTTCTTC * 1650 
1 6 51 CAGCTGCAGTGGAGAGGGTAACCCTCTACCCTGTATATGCAAAACTCGAG 1700 

1701 GTTAACATCATCCTAATTCTTGTATCAGCAACACCTCAGTGTCTCCACTC 1750 

• ■ • » • 

1751 ACTGCAGCGATTCTCTCAAATGTGAACATTTTAGAAGTTTGTGTTTCCTT 1800 

1801 TTGTCCATGTAATCATTGGTAATAC^GAATTTTATCTTGTTTATTAAAA 1850 

1851 CCATTAATGAGAGGGGAATAGGAATTAAAAGCTGGTGGGAAGGGCCTCCT 1900 

1901 GAATTTAGAAGCACTTCATGATTGTGTTTATCTCTTTTATTGTAATTTGA 1950 

1951 AATGTTACTTCTATCCTTCCCAAGGGGCAAAATCATGGGAGCATGGAGGT 2 00 0 

2001 TTTAATTGCCCTCATAGATAAGTAGAAGAAGAGAGTCTAATGCCACCAAT 2050 

2051 AGAGGTGGTTATGCTTTCTCACAGCTCTGGAAATATGATCATTTATTATG 2100 

2101 CAGTTGATCTTAGGATGAGGATGGGTTTCTTAGGAGGAGAGGTTACCATG 2150 

• " • • • 

2151 GTGAGTGGACCAGGCACACATCAGGGGAAGAAAACAATGGATCAAGGGAT 2200 

• * * * • 

2201 TGAGTTCATTAGAGCCATTTCCACTCCACTTCTGTCTTGATGCTCAGTGT 2250 
2251 TCCTAAACTCACCCACTGAGCTCTGAATTAGGTGCAGGGAGGAGACGTGC 2300 

FIG. 1b 
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2301 


AGAAACGAAAGAGGAAAGAAAGGAGAGAGAGCAGGACACAGGCTTTCTGC 


2350 


2351 


TGAGAGAAGTCCTATTGCAGGTGTGACAGTGTTTGGGACTACCACGGGTT 


2400 


2401 


TCCTTCAGACTTCTAAGTTTCTAAATCACTATCATGTGATCATATTTATT 


JV 


2451 


TTTAAAATTATTTCAGAAAGACACCACATTTTCAATAATAAATCAGTTTG 


2500 


2501 


TCACAATTAATAAAATATTTTGTTTGCTAAGAAGTAAAAAAAAAAAAAAA 


2550 


2551 


AAGTCGACGCGGCCGC 2566 






FIG. 1c 
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1 GCGGCCGCGTCGACGGTGCCTGTGAGTAAATAGATCAGGGTCTCCTTCAC 5 0 

5 1 AGCACATTCTCCAGGAAGCCGAGCAAACATTAGTGCTATTTTACCCAGGA 1 0 a 

101 GGAAATCTAGGTGTAGAGAGCTCTACGGATCTAAGGTCAACACCATGGTT 150 

M V 

151 CAACTTCAAGTCTTCATTTCAGGCCTCCTGCTGCTTCTTCCAGGCTCTGT 200 
QLQVFISGLLLLLPGSV 

201 AGATTCTT ATGAAGTAGTGAAGGGGGTGGTGGGTCACCCTGTCACAATTC 250 
DSYEVVKGVVGHPVTI P 

251 CATGTACTTACTCAACACGTGGAGGAATCACAACGACATGTTGGGGCCGG 300 
CTYSTRGGITTTCWGR 

301 GGGCAATGCCCAT ATTCTAGTTGTCAAAATATACTTATTTGGACCAATGG 350 
GQCPYSSCQNILIWT'NG 

351 ATACCAAGTCACCTATCGGAGCAGCGGTCGATACAACATAAAGGGGCGTA 400 
YQVTYRSSGRYN I KGR I 

• • • • . 

401 TTTCAGAAGGAGACGTATCCTTGACAATAGAGAACTCTGTTGATAGTGAT 450 
SEGDVSLTIENS-VDSD 

451 AGTGGTCTGTATTGTTGCCGAGTGGAGATTCCTGGATGGTTCAACGATCA 500 
SGLYCCRVEIPGWFNDQ 

501 GAAAATGACCTTTTCATTGGAAGTTAAACCAGAAATTCCCACAAGTCCTC 550 
KMTFSLEVKPEIPTSPP 

551 CAACAAGACCCACAACTACAAGACCCACAACCACAAGGCCCACAACTATT 600 
TRPTTTRPTTTRPTTI 

601 TCAACAAGATCCACACATGTACCAACATCAACCAGAGTCTCCACCTCTAC 650 
STRSTHVPTSTRVSTST 

651 TCCAACACCAGAACAAACACAGACTCACAAACCAGAAATCACTACATTTT 700 
P T P E Q T Q T H K P E I T T F Y 

701 ATGCCCATGAGACAACTGCTGAGGTGACAGAAACTCCATCATATACTCCT 750 
AHETTAEVTETPSYTP 

751 GCAGACTGGAATGGCACTGTGACATCCTCAGAGGAGGCCTGG7VATAATCA 800 
ADWNGTVTSSEEAWNNH 

801 CACTGTAAGAATCCCTTTGAGGAAGCCGCAGAGAAACCCGACTAAGGGCT 850 
TVRIPLRKPQRNPTKGF 

• • • • • 

851 TCTATGTTGGCATGTCCGTTGCAGCCCTGCTGCTGCTGCTGCTTGCGAGC 900 
YVGMSVAALLLLLLAS 

901 ACCGTGGTTGTCACCAGGTACATCATTATAAGAAAGAAGATGGGCTCTCT 950 
TVVVTRYI I IRKKMGSL 
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951 GAGCTTTGTTGCCTTCCATGTCTCTAAGAGTAGAGCTTTGCAGAACGCAG 1000 
S FVAFHVSKSRALQNAA 

1001 CGATTGTGCATCCCCGAGCTGAAGACAACATCTACATTATTGAAGATAGA 1050 
IVHPRAEDNIYI IEDR 

1051 TCTCGAGGTGCAGAATGAGTCCCAGAGGCCTTCTGTGGGGCCTTCTGCCT 1100 
S R G A E 

1101 GGGATTACAGAGATCGTGACTGATTTCACAGAGTAAAATACCCATTCCAG 1150 

1151 CTCCTGGGAGATTTTGTGTTTTGGTTCTTCCAGCTGCAGTGGAGAGGGTA 1200 

1201 ACCCTCTACCCTGTATATGCAAAACTCGAGGTTAACATCATCCTAATTCT 1250 

1251 TGTATCAGCAACACCTCAGTGTCTCCACTCACTGCAGCGATTCTCTCAAA 1300 

13 01 TGTGAACATTTTAGAAGTTTGTGTTTCCTTTTGTCCATGTAATCATTGGT 1350 

13 51 AATACAAGAATTTTATCTTGTTTATTAAAACCATTAATGAGAGGGGAATA 1400 

■ * * * > 

14 01 GGAATTAAAAGCTGGTGGGAAGGGCCTCCTGAATTTAGAAGCACTTCATG 14 50 

1451 ATTGTGTTTATCTCTTTTATTGTAATTTGAAATGTTACTTCTATCCTTCC 1500 

1501 CAAGGGGCAAAATCATGGGAGCATGGAGGTTTTAATTGCCCTCATAGATA 1550 

1551 AGTAGAAGAAGAGAGTCTAATGCCACCAATAGAGGTGGTTATGCTTTCTC 1600 

1601 ACAGCTCTGGAAATATGATCATTTATTATGCAGTTGATCTTAGGATGAGG 1650 

1651 ATGGGTTTCTTAGGAGGAGAGGTTACCATGGTGAGTGGACCAGGCACACA 1700 

1701 TCAGGGGAAGAAAACAATGGATCAAGGGATTGAGTTCATTAGAGCCATTT 1750 

1751 CCACTCCACTTCTGTCTTGATGCTCAGTGTTCCTAAACTCACCCACTGAG 1800 

1801 CTCTGAATTAGGTGCAGGGAGGAGACGTGCAGAAACGAAAGAGGAAAGAA 1850 

1851 AGGAGAGAGAGCAGGACACAGGCTTTCTGCTGAGAGAAGTCCTATTGCAG 1900 

1901 GTGTGACAGTGTTTGGGACTACCACGGGTTTCCTTCAGACTTCTAAGTTT 1950 

1951 CTAAATCACTATCATGTGATCATATTTATTTTTAAAATTATTTCAGAAAG 2000 

2001 ACACCACATTTTCAATAATAAATCAGTTTGTCACAATTAATAAAATATTT 2050 
2051 TGTTTGCTAAGAAGTAAAAAGTCGACGCGGCCGC 2084 
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1 GCGGCCGCGTCGACTCGCAGGAGGCCGGCACTCTGACTCCTGGTGGATGG 5 0 

5 1 GACTAGGGAGTCAGAGTCAAGCCCTGACTGGCTGAGGGCGGGCGCTCCGA 100 

101 GTCAGCATGGAAAGTCTCTGCGGGGTCCTGGTATTTCTGCTGCTGGCTGC 150 
MESLCGVLVFLLLAA 

151 AGGACTGCCGCTCCAGGCGGCCAAGCGGTTCCGTGATGTGCTGGGCCATG 200 
GLPLQAAKRFRDVLGHE 

201 AGCAGTATCCGGATCACATGAGGGAGAACAACCAATTACGTGGCTGGTCT 250 
QYPDHMRENNQLRGWS 

251 TCAGATGAAAATGAATGGGATGAACAGCTGTATCCAGTGTGGAGGAGGGG 3 00 
SDENEWDEQ LYPVWRRG 

301 AGAGGGCAGATGGAAGGACTCCTGGGAAGGAGGCCGTGTGCAGGCAGCCC 350 
EG RWKDSWE GGRVQAAL 

351 TAACCAGTGATTCACCGGCCTTGGTGGGTTCCAATATCACCTTCGTAGTG 400 
TSDSPALVGSNITFVV 

401 AACCTGGTGTTCCCCAGATGCCAGAAGGAAGATGCCAACGGCAATATCGT 450 
NLVF PR CQ KEDANGN I V 

451 CTATGAGAGGAACTGCAGAAGTGATTTGGAGCTGGCTTCTGACCCGTATG 500 
YERNCRSDLELASDPYV 

501 TCTACAACTGGACCACAGGGGCAGACGATGAGGACTGGGAAGACAGCACC 550 
YNWTTGADDEDWEDST 

551 AGCCAAGGCCAGCACCTCAGGTTCCCCGACGGGAAGCCCTTCCCTCGCCC 600 
SQGQHLRFPDGKPFPRP 

6 01 CCACGGACGGAAGAAATGGAACTTCGTCTACGTCTTCCACACACTTGGTC 650 
HGRKKWNFVYVFHTLGQ 

651 AGTATTTTCAAAAGCTGGGTCGGTGTTCAGCACGAGTTTCTATAAACACA 700 
YFQKLGRCSARVS IN T 

701 GTCAACTTGACAGTTGGCCCTCAGGTCATGGAAGTGATTGTCTTTCGAAG 750 
VNLTVGPQVMEVIVFRR 

751 ACACGGCCGGGCATACATTCCCATCTCCAAAGTGAAAGACGTGTATGTGA 800 
HGRAYI PISKVKDVYVI 

801 TAACAGATCAGATCCCTATATTCGTGACCATGTACCAGAAGAATGACCGG 850 
TDQI P I FVTMYQKNDR 

851 AACTCGTCTGATGAAACCTTCCTCAGAGACCTCCCCATTTTCTTCGATGT 900 
NSSDETFLRDLPIFFDV 

901 CCTCATTCACGATCCCAGTCATTTCCTCAACTACTCTGCCATTTCCTACA 950 
LIHDPSHFLNYSAISYK 
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951 AGTGGAACTTTGGGGACAACACTGGCCTGTTTGTCTCCAACAATCACACT 1000 
WNFGDNTGLFVSNNHT 

1001 TTGAATCACACGTATGTGCTCAATGGAACCTTCAACTTTAACCTCACCGT 1050 
LNHTYVLNGTFNFNLTV 

1051 GCAAACTGCAGTGCCGGGACCATGCCCCTCACCCACACCTTCGCCTTCTT 1100 
QTAVPGPCPSPTPSPSS 

1101 CTTCGACTTCTCCTTCGCCTGCATCTTCGCCTTCACCCACATTATCAACA 1150 
STSPSPASSPSPTLST 

• • • 

1151 CCTAGTCCCTCTTTAATGCCTACTGGCCACAAATCCATGGAGCTGAGTGA 1200 
PS PSLMPT GHKSMELSD 

1201 CATTTCCAATGAAAACTGCCGAATAAACAGATATGGTTACTTCAGAGCCA 1250 
ISNENCRINRYGYFRAT 

1251 CCATCACAATTGTAGATGGAATCCTAGAAGTCAACATCATCCAGGTAGCA 1300 
ITIVDGILEVNI'IQVA 

• * • • 

13 01 gatgtcccaatccccacaccgcagcctgacaactcactgatggacttcat 1350 
dvpiptpqp'dnslmdfi 

1351 tgtgacctgcaaaggggccactcccacggaagcctgtacgatcatctctg 1400 
vtckgatpteacti isd 

1401 accccacctgccagatcgcccagaacagggtgtgcagcccggtggctgtg 14 50 
ptcq iaqnrvcspvav 

1451 gatgagctgtgcctcctgtccgtgaggag agccttcaatgggtccggcac 1500 
delcllsvrrafngsgt 

1501 gtactgtgtgaatttcactctgggagacgatgcaagcctggccctcacca 1550 
ycvnftlgddaslalts 

1551 gcgccctgatctctatccctggcaaagacctaggctcccctctgagaaca 1600 
alis i pgkdlgsplrt 

1601 gtgaatggtgtcctgatctccattggctgcctggccatgtttgtcaccat 1650 
vngvlisigclamfvtm 

1651 ggttaccatcttgctgtacaaaaaacacaagacgtacaagccaataggaa 1700 
vtillykkhktykp i gn 

1701 actgcaccaggaacgtggtcaagggcaaaggcctgagtgtttttctcagc 1750 
ctrnvvkgkglsvfls 

1751 catgcaaaagccccgttctcccgaggagaccgggagaaggatccactgct 1800 
hakapfsrgdrekdpll 

■ • • » . 

1801 CCAGGACAAGCCATGGATGCTCTAAGTCTTCACTCTCACTTCTGACTGGG 1850 
Q D K P W M L 

1851 AACCCACTCTTCTGTGCATGTATGTGAGCTGTGCAGAAGTACATGACTGG 1900 

FIG. 3b 
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1901 


TAGCTGTTGTTTTCTACGGATTATTGTAAAATGTATATCATGGTTTAGGG 


1950 


1951 


AGCGTAGTTAATTGGCATTTTAGTGAAGGGATGGGAAGACAGTATTTCTT 


2000 


2001 


CACATCTGTATTGTGGTTTTTATACTGTTAATAGGGTGGGCACATTGTGT 


2050 


2051 


CTGAAGGGGGAGGGGGAGGTCACTGCTACTTAAGGTCCTAGGTTAACTGG 


2100 


2101 


GAGAGGATGCCCCAGGCTCCTTAGATTTCTACACAAGATGTGCCTGAACC 


2150 


2151 


CAGCTAGTCCTGACCTAAAGGCCATGCTTCATCAACTCTATCTCAGCTCA 


2200 


2201 


TTG7UVCATACCTGAGCACCTGATGGAATTATAATGGAACCAAGCTTGTTG 


2250 


2251 


TATGGTGTGTGTGTGTACATAAGATACTCATTAAAAAGACAGTCTATTAA 


2300 


2301 


AAA 2303 
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1 ATGCATCCTCAAGTGGTCATCTTAAGCCTCATCCTACATCTGGCAGATTC 5 0 
MHPQVVILSLILHLADS 

* • • > . 

5 1 TGTAGCTGGTTCTGTAAAGGTTGGTGGAGAGGCAGGTCCATCTGTCACAC 100 
VAGSVKVGGEAGPSVTL 

101 TACCCTGCCACTACAGTGGAGCTGTCACATCAATGTGCTGGAATAGAGGC 150 
P CHYSGAVTSMCWNRG 

151 TCATGTTCTCTATTCACATGCCAAAATGGCATTGTCTGGACCAATGGAAC 200 
SCSLFTCQNGIVWTNGT 

201 CCACGTCACCTATCGGAAGGACACACGCTATAAGCTATTGGGGGACCTTT 250 
H-VTYRKDTRYKLLGDLS 

251 CAAGAAGGGATGTCTCTTTGACCATAGAAAATACAGCTGTGTCTGACAGT 300 
RRDVSLTIENTAVSDS 

* • • • . 

301 GGCGTATATTGTTGCCGTGTTGAGCACCGTGGGTGGTTCAATGACATGAA 350 
GVYCCRVEHRGWFNDMK 

* • • * 

351 AATCACCGTATCATTGGAGATTGTGCCACCCAAGGTCACGACTACTCCAA 400 
ITVSLE IVPPKVTTTPI 

401 TTGTCACAACTGTTCCAACCGTCACGACTGTTCGAACGAGCACCACTGTT 450 
VTTVPTVTTVRTSTTV 

4 51 CCAACGACAACGACTGTTCCAACGACAACTGTTCCAACAACAATGAGCAT 500 
PTTTTVPTTTVP.TTMSI 

501 TCCAACG ACAACGACTGTTCCGACGACAATGACTGTTTCAACGACAACGA 550 
PTTTTVPTTMTVSTTTS 

551 GCGTTCCAACGACAACGAGCATTCCAACAACAACAAGTGTTCCAGTGACA 600 
VPTTTSI PTTTSVPVT 

601 ACAACGGTCTCTACCTTTGTTCCTCCAATGCCTTTGCCCAGGCAGAACCA 650 
TTVSTFVPPMPLPRQNH 

651 TGAACCAGTAGCCACTTCACCATCTTCACCTCAGCCAGCAGAAACCCACC 700 
EPVATSPSSPQPAETHP 

701 CTACGACACTGCAGGGAGCAATAAGGAGAGAACCCACCAGCTCACCATTG 750 
TTLQGAIRREPTSSPL 

751 TACTCTTACACAACAGATGGGAATGACACCGTGACAGAGTCTTCAG ATGG 8 00 
YSYTTDGNDTVTESSDG 

801 CCTTTGGAATAACAATCAAACTCAACTGTTCCTAGAACATAGTCTACTGA 850 
LWNNNQTQLFLEHSLLT 

851 CGGCCAATACCACTAAAGGAATCTATGCTGGAGTCTGTATTTCTGTCTTG 900 
ANTTKG I YAGVCISVL 

FIG. 4a 



SUBSTITUTE SHEET (RULE 26) 



WO 97/44460 



10/11 



PCT/US97/09303 
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